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THE SEVERE ACUTE RESPIRATORY SYNDROME CORONAVIRUS 

All documents dted herein are incorporated by lefraence in their entirety. 

RELATED APPLICATIONS, EHOM WHICH PRIORITY IS CLAIMED 

T his application incorporates by reference in its entirety US provisional patent application 
60/462,218, Attorney Reference No. PP20474.001, filed on April 10, 2003 via Express Mail with 
the US post office, US provisional patent application 60/462,465, Attorney Reference No. 
PP20480.001, filed on April 11, 2003 via Express Mail with the US post office, US provisional 
patent application 60/462,418, Attorney Reference No. PP20480.002, filed on April 12, 2003 via 
Express Mail with the US post office, US provisional patent application 60/462,748, Attorney 
Reference No. PP20480.003, filed on April 13, 2003 via Express Mail with the US post office, 
US provisional patent application 60/463,109, Attorney Reference No. PP20480.004, filed on 
April 14, 2003 via Express Mail with the US post office, US provisional patent application 
60/463,460, Attorney Reference No. PP20480.005, filed on April 15, 2003 via Express Mail with 
the US post office, US provisional patent replication 60/463,668, Attorney Reference No. 
PP20480.006, filed on April 16, 2003 via Express Mail with the US post office, US provisional 
patent application 60/463,983, Attorney Reference No. PP20480.007, filed on April 17, 2003 via 
Express Mail with the US post office, US provisional patent application 60/463 ,97 1 , Attorney 
Reference No. PP20480.008, filed on April 18, 2003 via Express Mail with the US post office, 
US provisional patent application 60/464,899, Attorney Reference No. PP20480.009, filed on 
April 22, 2003 via Express Mail with the US post office, US provisional patent application 
60/464,838, Attorney Reference No. PP20507.001, filed on April 22, 2003 via Express Mail with 
the US post office, US provisional patent application 60/465,273, Attorney Reference No. 
PP20518.001, filed on April 23, 2003 via Express Mail with the US post office, US provisional 
patent ^>plication 60/465,535, Attorney Reference No. PP205 18.002, filed on April 24, 2003 via 
Express Mail with the US post office, US provisional patent application 60/468312, Attorney 
Reference No. PP20480.010, filed on May 5, 2003 via Express Mail with the US post office, and 
US provisional patent application 60/473,144, Attorney Reference No. PP20480.01 1, filed on 
May 22, 2003, US provisional patent application 60/495,024, Attorney Reference No. 
PP20480.012, filed on August 14, 2003 via Express Mail with the US post office, US provisional 
patent application 60/505,652, Attorney Reference No. PP20480.013, filed on September 23, 
2003 via Express Mail with the US post office, US provisional patent application 60/510,781, 

Attorney Reference No. PP20480.014, filed on October 11, 2003 via Express Mail with the US 
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post office, US provisional patent appUcation 60/529,464, Attorney Reference No. PP20480.015, 
filed on December 11, 2003 via Express Mail witii the US post office, US provisional patent 
appHcalion 60/536,177, Attorney Reference No. PP20480.016, filed on January 12, 2004 via 

Express Mail with the US post office, and US provisional patent application 60/ — , , 

5 Attorney Reference No. PP20480.017, filed on April 7, 2004 via Express Mail with the US post 
office. 

FIELD OF THE INVENTION 

The invention relates to nucleic acids and proteins firem Severe Acute Respiratory 
Syndrome (SARS) Virus. These nucleic acids and proteins can be used in the preparation and 
10 manufacture of vaccine formulations for the treatment or prevention of SARS. The invention 

also relates to diagnostic reagents, kits (comprising such reagents) and methods which can be 
used to diagnose or identify the presence or absence of a SARS virus in a biological sample. The 
inv^don also relates to methods for die treatment or prevention of SARS utilizing small 
molecule viral inhibitors and combinations of small molecule viral inhibitors and kits for the 
15 treament of SARS. 

BACKGROUND OF THE INVENTION 

An outbreak of a virulent respiratory virus, now known as Severe Acute Respiratory 
Syndrome (SARS), was identified in Hong Kong, China and a number of other countries around 
the world in 2003. Patients typically had symptoms including fever, dry cough, dyspnea, 

20 headache, and hypoxemia. Isolates of the SARS virus appear to have homology with at least the 
RNA polymerase gene of several known coronaviruses. A phylogenetic analysis of this 
homology is presented in Peiris et al., “Coronavirus as a possible cause of severe acute 
respiratory syndrome”. Lancet^ published online April 8, 2003 at 

http:/Amage.aielanceLcojn/extras/03art3477web.p^, incorporated herein by reference in its entirety. 

25 Other sequenced fragments of the SARS virus genome appear to overlap with the open reading 

fiame lb of coronaviruses. See, Drosteo et cd., “Identification of a Novel Coronavirus in 
Patients with Severe Acute Respiratory Syndrome”, New England Journal of Medicine, 
published online at htq>://www.nejm.org on April 10, 2003, incorporated harein by reference in its 
entirety. 

30 The Genome Sciaice Center in British Colombia, Canada published on its website 

Qittp://www.hcgsc.ca/bioitfo/SARS/) a draft g^ome assembly of 29fl36 base pairs of a virus believed 
to be a SARS virus, referred to as the TOR2 isolate. This draft genome assembly is given herein 
asSEQIDNO: 1. 

llie Centers for Disease Control (CDQ published a nucleotide sequence of a SARS-CoV 
35 . stmn(,SWi1DNO:2)<mitswtb^tsiht^:/Mww.cdc.gov/ncidod/sars^><^^leoseq.p^. TheCDC 
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has also published a phylogenetic tree of the predicted N, S and M proteins (attached as FIGURE 
6). Hiis tree places the SAKS virus outside any of die previously known coronaviius groups. 

There is a growing need for prophylactic or therapeutic vaccines against the SAKS virus as 
weU as diagnostic and screening methods and compositions to identify the presence of the virus 
5 in, e.g., mammalian tissue or serum. 

SUMMARY OF THE INVENTION 

The invention relates to nucleic acids and proteins from Severe Acute Respiratory 
Syndrome (SARS) virus. These nucleic adds and proteins can be used in the preparation and 
manu^ture of vaccine formulations for the treatment or prevention of SARS. Such vaccine 
10 formulations may include an inactivated (or killed) SARS virus, an attenuated SARS virus, a 
split SARS virus preparation and a recombinant or purified subunit formulation of one or more 
SARS viral antigens. Expression and delivery of the polynucleotides of the invention may be 
facilitated via viral vectors and/or viral particles. 

Uie invention also relates to diagnostic reagents, kits (comprising such reagents) and 
15 methods which can be used to diagnose or identify the presence or absence of a SARS virus in a 
biological sample. The invention further includes non-coding SARS viral polynucleotide 
sequences, SARS viral sequences encoding for non-immunogenic proteins, conserved and 
variant SARS viral polynucleotide sequences for use in such diagnostic compositions and 
methods. 

20 The invention fiirdier relates to vaccine formulations comprising one or more SARS vims 

antigens and one or more other respiratory vims antigens. Additional respiratory virus antigens 
suitable for use in the invention include antigens from influenza vims, human rhinovims (HRV), 
parainfluenza virus (PIV). respiratory syncytial virus (RSV), adenovirus, metapneumovirus, and 
riiinovims. Uie additional respiratory virus antigen could also be from a coronavirus other than 
25 the SARS coronavirus. Preferably, the additional respiratory virus antigen is an influenza viral 
antigen. 

The compositions of the invention may further comprise one or more adjuvants. Adjuvants 
suitable for use in the invention include mucosal, transdermal or parenteral adjuvants. Mucosal 
adjuvants suitable for use in the invention include detoxified bacterial ADP-ribosylating toxins, 
30 such as E. coli heat labile toxoids (e.g . , LTK63), chitosan and derivatives thereof, and non-toxic 
double mutant forms of Bordetella pertussis toxoids. Parenteral adjuvants suitable for use in the 
invention include MF59 and aluminum or aluminum salts. 

The invration also provides methods for treating SARS by administering small molecule 
compounds, as weU as methods of identifying potent small molecules for the treatment of SARS. 
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In one aspect of the invention a method of identif^g a therapeutically active agent is 
provided comprising: (a) contacting the therapeutically active agent with a cell infected with the 
SAKS virus; (b) measuring attenuation of a SARS related mizyme. 

In a mote particular embodiment, the therapeutically active agent is a small molecule. In 
5 another mote particular embodiment, tiie flierq)eutically active agent is a nucleoside analog. In 
another more particular embodiment the thersq)eutically active agent is a peptoid, oligopeptide, 
orpolyp«q)tide. In another embodiment the SARS related enzyme is SARS protease. In another 
embodiment the SARS related enzyme is SARS polymerase. In still another embodiment the 
SARS related enzyme is a kinase. Methods of identifying therapeutically active agents for 
10 treatment of SARS virus infection are further discussed in Section V below. 

In another aspect of the invention a method of treating a human infected with SARS is 
provided comprising admini.«rtering a small molecule to a patient in need thereof. In one 
embodiment the small molecule is an inhibitor of SARS protease. In another embodiment the 
small molecule is an inhibitor of SARS polymerase. In another embodiment the SARS related 
15 enzyme is a kinase. In still another embodiment the small molecule is administered orally or 
parenterally. 

The invention also provides the use of such small molecules in tiie manufacture of a 
medicament for the treatment of severe acute respiratory syndrome. 

Small molecule compounds of the present invention include those of less than 1000 g/mol, 
20 preferably with an aromatic region and greater than one heteroatom selected from O, S, or N. 
Preferred small molecules include, but are not limited to acyclovir, gancyclovir, vidarabidine, 
foscamet, cidofovir, amantidine, ribavirin, trifluorothymidine, zidovudine, didanosine, 
zalcitabine, and combinations thereof. Interferons may also be used for treating patients, 
including interferon-a and interferon-p. Interferon treatment has shown promise in treating 
25 SARS in monkeys (Enserink (2004) Science 303: 1273-1275), particularly when pegylated 
(Haagmans et al. (2(K)4) Nature Medicine 10:290-293). 

One aspect of the present invention relates to methods for identi^ng individuals exposed 
to, and biological samples containing SARS virus (SARSV), and to kits for carrying out the 
methods. Such methods can utilize nucleic acid detection techniques such as PCR, RT-PCR (the 
30 Coronaviridae are RNA viruses), transcription-mediated amplification (TMA), ligase chain 
reaction (LC3R), branched DNA signal amplification assays, isothermal nucleic acid sequence 
based amplification (NASB A), other self-sustained sequence replication assays, boomerang 
DNA amplification, strand-displacement activation, cycling probe technology, or combinations 
of such amplification methods. Such nucleic acid detection techmques utilize oligonucleotides 
35 having nucleotide sequence similar to, or complementary to, the SARS viral genome, as primers 

(e.g., for anq>lification) and as probes (e.g-, for capture or detection), as is well known in the art. 

■A- 
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Alternatively, or in addition to the nucleic acid detection methods described supra, the 
methods of the piesait invention can utilize various immunoassay techniques for detection of 
SARSV antigens and/or antibodies. 

Accordingly, the present invention relates to methods of identifying individuals exposed to 
5 SARSV, or biological samples containing SARSV, by detecting the presence of SARSV 
antigens nsiTi g antibodies which specifically bind to the same. Hie antibodies are preferably 
monoclonal antibodies. Quantification of the amount of viral antigens present in a sample of an 
individual may be used in determining the prognosis of an infected individud. Preferably, the 
SARSV antigens to be detected are generally one of the structural proteins, particularly those 
10 present on the surface of the viral particles and include, for example, the spike glycoprotein (S), 

also called E2; the envelope (small membrane) protein (E), also called sM; the membrane 
glycoprotein (M), also called El; the hemagglutinin-esterase glycoprotein (HE); also called E3; 
and the nucleocapsid phosphoprotein (NT), to preferred embodiments, the antigens to be detected 
are the S, E and M proteins using antibodies to the same. 

15 Tbe present invention relates to kits for identifying individual SARSV and reagents used in 

such kits. The kits comprise a first container which contains antibodies which specifically bind to 
a SARSV antigen and a second container which contains the SARSV antigen. The antibodies are, 
preferably monoclonal antibodies. The kits may be adapted for quantifying the amount of 
antigen in a sample of an individual. Such information may be used in determining the prognosis 
20 of an infected indi^'idual. 

The present invention relates to methods of identifying individuals exposed to SARS virus, 
or biological samples containing SARSV, by detecting the presence of antibodies against SARS 
virus antig^ in a sample using SARS antigen. Quantification of the amount of anti-SARS 
protein from SARS antibodies present in a sample of an individual may be used in determining 
25 the prognosis of an infected individual. Any one or mote of the viral proteins (stractural proteins 
or nonstructural proteins) may be used as antigen to detect the SARSV antibodies; preferably a 
SARSV antigen that is consearved amoung SARSV isolates is preferred. In this regard, 
nonstructural protein (e.g., Pol, Hel, 3G4>, MP, PLPl, PLP2) may be particularly useful. 

The present invention relates to kits for identifying individuals exposed to SARS and 
30 reagents used therein. The kits comprise a first container which contains antibodies which were 
produced in response to exposure to an antigen fiom SARS virus and a second container which 
contains the SARS antigen(s). TTie kits may be adapted for quantifying the amount of anti-SARS 
antibodies present in a sample of an individual. Such information may be used in detmminmg the 
prognosis of an i nf ected individual. 

35 Tire present invention relates to methods of identifying individuals exposed to SARS virus, 

or biological samples containing SARSV, by detecting die presence of nucleic acid from SARS 
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virus. Quantification of the amount of S ARS nucleic acid present in a sample of an individual 
may be used in determining the prognosis of an infected individual. The methods utihze 
oligonucleotide probes and/or primers that are similar or complementary in sequence to the 
SARSV genome or transcription or replication products. Prefacred probes and prim^ are 
5 described haein. Also included in the present invention are kits for carrying out the methods of 
detecting the SARSV nucleic acid. 

The invention further includes a method for the treatment and/or prevention of SARS 
through the administration of a therapeutically effective amount of at least one antiviral 
compound from among those described in the US Patents and published international patent 
10 ^plications listed in Table 1 and Table 2. In one embodiment of the method, the antiviral 
compound is a small molecule. In another embodiment, the antiviral compound is a protease 
inhib itor, hi a further embodiment, the antiviral protease inhibitor is a 3C-like protease inhibitor 
and/or a papain-like protease inhibitor. In another embodiment, the antiviral compound is an 
inhib itor of an RNA-dependent RNA polymerase. In another embodiment, a first antiviral 
15 compound which is a protease inhibitor is admini stered with a second antiviral compound which 

is an RNA-dependent RNA polymerase inhibitor. The invention further provides for the 
adminis tration of a Steroidal anti -inflammat ory drug in combination with at least one antiviral 
compound, for example, from the antiviral compounds described in the documents listed in 
Table 1 and Table 2. 

20 Theinventionfurther provides for a method for the treatment and/or prevention of SARS 

through the administration of a therapeutically effective amount of at least one antiviral 
compound from among those described in the US Patoits and published international patent 
applications listed in Table 1 and Table 2 by inhalation. In one embodiment of the method, the 
antiviral compound is a smaU molecule, hi another embodiment, the antiviral compound is a 
25 protease inhibitor. In a further embodiment, the antiviral protease inhibitor is a 3C-like protease 
inhibitor and/or a papain-like protease inhibitor. In another embodiment, the antiviral compound 
is an inhibitor of an RNA dqiendent RNA polymerase, hi another embodiment, a first antiviral 
compound which is a protease inhibitor is ad mi nistered with a second antiviral compound which 
is an RNA-dq)«ident RNA polymerase inhibitor. The invention further provides for die 
30 administration of a steroidal anti-inflammatory drug in combination with at least one antiviral 
compound, for example, from the antiviral compounds described in the documents listed in 
Table 1 and Table 2 by inhalation. The steroidal anti-inflammatory drug may be administered by 
inhalation for a local effect or administered for systemic absorption such as via an oral or 
intravenous route. 

35 Hie invention further provides the use of an antiviral compound, as defined above, in the 

manufacture of a medicament for the treatment of severe acute respiratory syndrome. 
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Hie invCTtioii further provides for a kit for use by a consumer for the treatment and/or 
prevention of SARS. Such a kit comprises: (a) a pharmaceutical composition comprising a 
therapeutically effective amount of at least one antiviral compound from among those described 
in the US Patents and published international patent applications listed in Table 1 and Table 2 
5 and a pharmaceutically acceptable carrier, v^cle or diluoit; (b) a container for holding the 
pharmaceutical composition; and, optionally; (c) instructions describing a method of using the 
pharmaceutical compositions for the treatment and or the prevention of SARS. The kit may 
optionally contain a plurality of antiviral compounds for the treatment of SARS wherein the anti 
viral compoimds are selected from 3C-]ike protease inhibitors and papain-like protease 
10 inhibitors. Id a fiuther embodiment, the kit contains an antiviral compoimd which is an RNA- 
dependent RN A polymerase inhibitor. When the kit comprises more than one antiviral 
compound, the antiviral compounds contained in the kit may be optionally combined in the same 
pharmaceutical composition. 

An additional aspect of the invention provides for the use of at least one of the antiviral 
15 compounds described in the US Patents and published international patent applications listed in 
Table 1 and Table 2 for the manufacture of a medicament for the treatment or preveiition of 
SARS. 

BRIEF DESCRIPTION OF THE DRAWINGS 
FIGURE 1: Schematic of coronavirus genome organization. 

20 FIGURE 2: Schematic of coronavirus ORFla/ORFlb gene products. 

FIGURE 3 (A - C): Alignment of coronavirus polynucleotide sequences for selected genes 
(including nucleocapsid (N), matrix (M), and hemagluttinin-esterase (HE)). 

FIGURE 4 (A - F): Alignmmit of coronavirus polypq>tide sequences (including ORFla/ORFlb, 
nucleocapsid (NP), hemagluttinin-esterase (HE), envelope (Sm or E), matrix (M), and spike (S). 

25 FIGURE 5: Alignment of spike (S) polypeptide sequences, taken from Figure 4, in the region of 
the junction of the SI and the S2 domains, and protease cleavage site for selected coronaviruses. 

FIGURE 6: CDC phylogenetic tree of SARS-CoV strain (Qustalx 1.82, neighbor-joining tree). 
Hgure 6A shows coronavirus N protein analysis, Hgure 6B shows coronavirus S protein 
analysis, and Hgure 6C shows coronavirus M protein analysis. 

30 FIGURE 7: Conserved and specific sequence of the SARS virus. Hgures 7A-7D show multiple 
sequence alignments (CLUSTAL W 1.82) of tiie structural proteins of the SARS virus genome 
(7 A: PEP4 Spike protein; 7B: PEP7 small membrane protein; 7C: PEPS matrix glycoprotein; 7D: 
PEP13 nucleocapsid protein), which have counterparts in aU or some of the other known 
coronaviruses. Hgures 7E-7H show dendrograms reporting the protein distances among the 
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sequences in alignments 7A-7D. T-ahels 229E: human coionavirus; MEV: murine hepatitis virus; 
TGV: transmissible gastroenteritis virus; AIBV; avian infectious bronchitis virus; BOVINE: 
Bovine coronavirus; PEDV: porcine epidemic diarrhea virus. 

FIGURE 8: Alignment of the 5UTR of several coronaviruses, to show consensus nucleotide 
5 sequence at die 5TJTR. 

FIGURE 9: Sequences of preferred primers for amplification of the STITR. F and R denote 
forward and reverse PCR primers, ^d the numbers indicate nucleotide positions withing Figure 
8 . 

FIGURE 10; Alignment of the 3XJTR of several coionaAoruses, to show consensus nucleotide 
10 sequence at the SUTTR. 

FIGURE 11: Sequences of preferred primers for amplification of the 3UTR. F and R denote 
forward and reverse PCR primers, and numbers indicate nucleotide positions within Figure 10. 

FIGURE 12: Coiled-coil prediction for SEQ ID NO: 6042, using Coils program (Rguie 12A) or 
LearrCoil (Figure 12B). ' 

15 FIGURE 13; Example of insertion of a reporter gene-of-interest at a site between exisiting SARS i 
virus genes. Small nonstractural gene products are not depicted schematically. 

FIGURE 14: Schematic depicting representative examples of SARS virus replicons. Small 
nonstmctural gene products are not dqpicted schematically. 

FIGURE 15: SARS virus nsp2 protdnase (3CLp) and identification of catalytic and substrate 
20 sites. 

FIGURE 16: alignment of SARS virus nsp2 prot^nase (3CLp) with that of avian IBV, MHV, 
and BCoV. Residues in dotted boxes are key residues the substrate sites (F, Y & H); residues in 
solid boxes are catalytic cysteine (C) and histidine (H) residues. 

FIGURE 17: Genome organization of SARS coronavirus. Replicase and stractural regions are 
25 shown, along with the predicted products of cleavage within ORFla and ORFlb. The position of 
the 5’ RNA leader sequence ^), the 3’ poly(A) tract and the ribosonoal frame-shift consensus 
between ORFla and ORFlb are also indicated. Each box represent a protein product. They are. 
shaded according to the level of amino acid identity with corresponding proteins of other 
coronaviruses (see also Table 2). The SARS-specific genes are white. Positions of the 9 SARS- 
30 specific six-base IG sequences (5 -ACGAAC-3'; SEQ ID NO 7293) are indicated by arrows. 

FIGURE 18: Goiome organization of Coronaviruses representative of group 1 (HCoV-229E, 
accession number AF304460), group 2 (mouse hqpatitis virus MHV, accession number: 
NC_001846), group 3 (avian infectious bronchitis virus AIBV, accession number NC_001451) 
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and SARS coronavirus. Other completely sequenced coionaviiuses used in this study are 
available at die following accession numbers: porcine epidemic diarrhea virus (PEDV), 

AF353511; transmissible gastroenteritis virus (TGV), NC_002306; Bovine coronavirus (BCoV): 
AE220295. Red boxes represent group-specific genes. T^e position of the leader RNA sequence 
5 and poly(A) tract is also indicated in genomes where they are reported. The position of specific 
IG sequences is indicated by circles of different shades. In the SARS genome' we also find three 
IG sequences specific for group 2 coronavirus. . 

FIGURE 19: Topological model predicted for the spike protein anchored to the vital membrane. 
Structural and predicted functional domains are indicated. ITie N-terminal region (SI) is 
.0 predicted to contain the receptor binding domain. Two coiled coil regions within the S2 domain, 
partially superimposed to leucine zipper motifs are presumably involved in oligomerization. The 
hydrophobic domain is responsible for membrane anchoring. . . 

FIGURE 20: PhylogMietic tree obtained from the multiple sequence alignment of a 922 bp 
internal region of thepoZ gene from 12 coronaviruses and SARS. Numbers at the nodes represent 
15 the result of a bootstrap analysis and strongly support the branches. Sequences not available 
within the complete coronavirus genomes have been retrieved from GenBank at the following 
accession numbers: hemagglutinating micephalomyelitis virus of swine (PHEV), AF124988, 
Human OC43 virus (OC43), AF124989, canine coronavirus (CCV), AF124986, feline infectious 
peritonitis virus (FIPV), AF124987, turkey coronavirus (TCV), AF124991, syaloacryoadenitis 
20 virus of rats (SDAV),AF124990. 

FIGURE 21: 21A . Uiuxx)ted tree obtained from the alignment of consensus sequences of the 
group I and group II SI domain of spike proteins (Gl_cons and G2_cons) with those of a group 
3 spike (AIBV) and the spike of SARS virus. The number indicates the result of a bootstrap 
analysis. The sequences used to generate the consensus profile from group 1 are: HcoV-229E, 

25 accession number P15423; porcine epidemic diarrhea virus (PEDV), acc no: NP_598310; 

transmissible gastroenteritis virus (TGV), acc no: NP_058424; Canine coronavirus (CCV), acc 
no: S41453; porcine respiratory virus (PRV), acc no: S24284; feline infectious pentonitis virus 
(FIPV), acc no: VGIH79. TTie sequences used to generate the consensus profile fix>m group 2 
are: mouse hepatitis virus (MHV), acc no: NP_045300; Bovine coronavirus (BCoV), acc no: 

30 NP_150077; HnTnan coronavirus OC43, acc no: P36334; hemagglutinating encephalomyelitis 

virus of swine (PHEV), acc no: AAL80031; for group 3, only the sequence of the spike protein 
of avian infectious bronchitis virus (AIBV), acc no: AA034396 was used. 21B: Schematic 
representation of cysteine positions in SI domains of group 1. 2 and 3, compared to the SARS 
spike. Horizontal bars represent the SI amino acid sequences (in the case of SARS and AIBV) or 
35 the consensus profiles (generated fiom group 1, Gl_cons, and fiom group 2, G2_cons). The 
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length of the bars are not to scale. Relative cysteine positions are indicated by rectangle bars. 
Only cysteines perfectly conserved within each consensus are reported. Lines connect cysteines 
consCTved between die SARS SI domain and the consensus sequences as shown. 

FIGURE 22: illustration of a Neisseria Adhesin A protein (NadA). 

5 FIGURE 23: Raw translation from SARS coronavirus gmiome (reading frame +1). 

FIGURE 24: Raw translation from SARS coronavirus genome (reading frame +3) 

FIGURE 25: lb and Spike open reading frames, separated by * . 

FIGURE 26: SARS growth in veto cells. 

FIGURE 27: chromatogram of die capture step of SARS coronavirus on Matrix Cellufine 
10 Sulfate Superformance 150/10. Analysis was on 100ml coronoavirus harvest. TTie left Y axis 
shows absorbance at 280nm. The right Y axis shows the gradient (%B). The X axis shows the 
volume (ml). 

FIGURE 28: SUver-stained MCS chromatography fractions. Lanes are: (1) marker, 

(2) coronavirus vero cell harvest; (3) coronavirus veto cell harvest, after 0.65/im fiOitration; 

15 (4) flowthrough; (5) wash; (6) 20% peak (virus peak). Lanes were loaded with 1 /rg of test 

protein. 

FIGURE 29: Western Blot of MCS chromatography fractions. Lanes are as described for Fig.28. 

FIGURE 30: linear density gradient ultracentrifugation, 15-60% sucrose (SW28, 2 hours, 

20000 rpm). Tlie graph shows protein concentration (■) and sucrose concentration (♦). 

JO FIGURE 31 : Silver-stained density gradient fiactions on NuPage 4-12% Bis-Tris-Ge Q^ovex), 

reduced conditions, heated for 10 minutes at 70°C. Lanes are: (1) marker, (2) 20% peak MCS; 

(3) density gradient fraction 11; (4) density gradient fraction 12; (5) density gradient fraction 13; 
(6) density gradient fiaction 14; (7) density gradient fraction 15; (8) density gradient fraction 16; 
(9) density gradient fraction 17. Hie bulk of proteins was in fiactions 15 to 17. Lanes 2, 8 and 9 

15 were loaded with Ifig protein. 

FIGURE 32: Chromatogram of the C^ture Stq> of SARS coronavirus on MCS. Details are as 
for Rgure 27, except that 200ml harvest was used. 

FIGURE 33: Silverstain Oeft) and Western Blot (right) of chromatogr^hic fractions. Lanes are 
as described for Hgures 28 and 29, except that lane (6) is the 5% peak. Treatment before 
>0 SDS-PAGE was at room temperature for 30 minutes. 

FIGURE 34: Density Gradient Ultracentrifugation, 15-40% sucrose (SW28, 2 hours, 20000 
rpm). The graph shows protein concentration (■) and sucrose concentration (♦). 
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FIGURE 35: Silverstain Oeft) and Weston Blot (d^t) of Density Gradient Ultracentrifugation 
fractions on NuPage 4-12% Bis-Tris-Ge (Novex), reduced conditions. Lanes are: (1) marker, (2) 
density gradient fraction 6; (3) density gradient fraction 7; (4) density gradient fraction 8; (5) 
density gradient fraction 9; (6) density gradient fraction 10; (7) density gradient fraction 15. 

5 Fractions 7-10 (lanes 3-6) contained pure coronavirus proteins. The bulk of impurities was in 
fraction 15 (lane 7). Lanes 2, 8 and 9 were loaded with ~lfig protein. Treatment before 
SDS-PAGE was at room temperature for 30 minutes. 

FIGURE 36: EM pictures of Density Gradient Fractions 8-10. Figure 36A shows fraction 8; 
Rgure 36B shows fraction 9; Figure 36C shows fraction 10. 

10. FIGURE 37: Spike/NadA fusion constmcts. 

FIGURES 38 and 39: Results of the expression in E.coli of Sit, Slt-NadA and Slt-NadA^andior. 
Figure 38 shows SDS-PAGE analysis of total lysates from BL21(DE3)/pET, BL21(DE3)/pET- 
Slt and BL21(DE3)/pEI‘-Sli.-Nad AAai«-w . The bands are indicated by an arrow, and the three 

lanes are, from left to right- BL21(DE3)/pET; BL21(DE3 )/pET-S1l; BL21(DE3)/pEr- 

15 Slt-NadA^anchor. HguiB 39 shows (39A) SDS-PAGE and (39B) western blot analysed of total 
lysates from BL21(DE3)/pET, BL21(DE3)/pET-SlL-NadA (grown under un-induced condition) 
and BL21(DK)/pET-Slt-NadA (grown under induced condition). The bands are indicated by an 
arrow, and lanes are, from left to right BL21(DE3)/pET; BL21(DE3)/pET-SlL-NadA; 
BL21(DE3)/pBr-Slt-NadA. The western blot shows the presence of oligomeric forms of the 
iO protein. 

FIGURE 40: Schematic of SARS Spike clones. 

FIGURE 41: Transient Expression of SARS Spike Proteins (western blot of COS7 cell lysate). 
Each lane of the 4-20% TG SDS ^1 was loaded with 20^g cell lysate (total 1.2mg). The labeling 
antibodies are shown. 

5 FIGURE 42: Weston blot analyses of COS7 cell lysates on 4% TG SDS gel showing 

oligomerization state of intracellular S molecules. 

FIGURE 43: Western blot analyses of COS7 cell lysates on 4-20% TG SDS gel showing 
Transient Expression of SARS Spike Proteins. Lanes are: (1) mock, AF; (2) mock, DF; (3) nSh, 
AF; (4) nSh, DF; (5) nShATC, AF; (6) nShATC, DF. Each lane was loaded with 5/tl of each 
0 sample, 400/il total. The blot was labeled with antibody against the His-tagged protein. 

FIGURE 44: Western blot analyses of CXDS7 cell medium on 4-20% TG SDS gel showing 
Transient Expression of SARS Spike Proteins. Truncated spike protein is secreted. Spike 
jnoteins were purified from the culture medium (from a 10cm plate), first by a ConA column and 
then finally by EQsnag Magnetic beads. Each lane was loaded with one third of the material. 
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FIGURE 45: Western blot analyses of CX)S7 cell lysates on 4-20% TG SDS gel showing 
glycoslation of SARS spike proteins. In the two left-hand blots (lanes 1-S), samples were boiled 
in SDS and P-mracr^toethanol; in the two li^t-hand blots (lanes 6-1 1), samples were in SDS 
only, with no boiling. Lanes 1-8 were labeled widi a monoclonal raised against the His-tag 
5 protein; labes 9-1 1 were labeled with rabbit anti-SARS antibody. 

FIGURE 46: Effect of SARS spike protein expression on cell viability. 

FIGURE 47: Western blot analyses of COS7 cell lysates on 4% TG SDS gels showing 
oligomerization state of intracellular spike molecules. Blots were labeled with anti-His-tag mAb. 
The membrane firaction of COS7 cell lysate was firactionated by a sizing column before loading 
10 the lanes. Fractions 7 to 14 show bands with kDa values of: 71000, 1400, 898, 572, 365,232, 148 
and 99, respectively. 

FIGURE 48: Rnctionation of cells into aqueous and detra'gent fractions. 

FIGURE 49: Schematic of constructs for use in OMV preparation. 

HGURE 50: SARS HRl and HR2 constracts. 

15 FIGURE 51: Vaccine protection firoms SARS in Balb/c mouse model. 

FIGURE 52: Expressed on Spike protein in transfected 293 cell lysates (52A) or COS7 cell 
culture supernatants (52B). Proteins were separated on 4-20% TG SDS gels. The label was anti- 
Ifis-tag, except for the right-hand three lanes of 52B, where the label was rabbit anti-SARS 
sejxun. Li Rgure 52A, the left-hand three lanes were treated with DTT and were boiled, but 
20 neither treatment was used for the right-hand three lanes. In Figure 52B, no DTT was used, but 
all lanes were heated to 80°C for 5 minutes. 

FIGURE 53: Western blot of Spike proteins expressed in COS7 cells. Proteins were incubated at 
room temperature (RT), 80°C or 100°C to check for any effect on molecular weight FIGURE 54 
shows similar experiments on SARS virions. 

25 FIGURE 55: Results of a pulse chase experiment, showing expression and processing of SARS 

spike protein following infection with alphavirus r^licon particles. Cells were treated with or 
without EndoH as shown. 

FIGURE 56: Effect of heating on Spike protein trimers. 

FIGURE 57: Coomassie blue-stained ^1 of yeast-expressed proteins. Lanes are: 1-See Blue 
30 Standard (lOpl); 2-pAB24 gbl (20/ig^; 3-SARS Spike SI c.l gbl (20/ig); 4-SARS Spike SI c.2 
gbl (20/ig); 5-See Blue Standard (10/il); 6^AB24 ip (5/il); 7-SARS Spike SI c.l (5/il); 8-SARS 
Spike SI c.2 (5/il). 

FIGURES 58 to 64: Schematics of preparation of yeast expression constructs. 
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FIGURES 65 to 66: Yeast-expiessed sequences for Spike. 

FIGURE 67: Western blots showing expression of SARS spike protein from alphaviius replicon 
particles and replicon RNA. Rgore 67A was run undra* non-reducing conditions and at room 
temperature (Le. no heating), with lanes: (1) VEE/SIN-spike infection; (2) VEE/SIN-GFP 
5 infection; (3) Replicon-spike RNA transfection; (4) Replicon-GFP RNA transfection. Figure 67B 
was run with SARS virions at different temperatures, as shown. 

FIGURE 68: induction of antibody responses in mice. Vaccine groups are: (1) Inactivated SARS 
Virus; (2) Truncated Recombinant Spike Protein; (3) Full length Spike: DNA+DNA.PLG+ 
Alphavirus; (4) Full length Spike: Alphavirus particles only. 

10 FIGURE 69: Binding of human monoclonal antibody S3.2 to purified truncated Spike protein. 
The X-axis shows antibody concentration, and the Y-axis shows ELISA absorbance. The 
interpolation result is 2158.13. 

FIGURE 70: Geometric mean ELISA titers of antibodies induced by the SARS-CoV spike 
protein delivered as different vaccines Oeft to ri^t: inactivated virus; 3fig truncated spike 
15 protein; 75/rg DNA ^coding truncated spike protein. 

FIGURE 71: Neutralization titers after immunization with (left) nSdATC protein or (right) DNA 
encoding nSdATC, delivered on PLG. 

FIGURE 72: Correlation between the spike antigen binding and ne utralizin g antibodies 

FIGURE 73: Western blot of CHO cell lines expressing Spike protein in fiiU-length form Geft) or 
20 in truncated form (right). Proteins were separated by 4-12% SDS-PAGE, with boiling in DTT 
and staining by polyclonal serum. 

FIGURE 74: Stmctural comyponents of SARS-CoV spike glycoprotein and expression construct. 
L denotes leader peptide (residues 1-13), TM the transmembrane, and Cy the cytoplasmic tail 
segments. The hexa-His tags are not shown. 

25 FIGURE 75: Western blot analysis of SARS spike proteins expressed in COS7 cells. In Figure 
75A, COS7 cells were transfected with indicated plasrnid constructs and the expressed proteins 
in cell lysates 48 hr post-transfection were analysed by SDS-PAGE (4-20% polyacrylamide) in 
reducing and denaturing condtions, with proteins visualized by anti-histidine Mab. In Figure 
75B, proteins were collected from cell culture medium 48 hr post-transfection and purified first 
30 by a ConA column and then by His-tag magnetic beads. Purifed proteins were analysed by SDS- 
PAGE (4-20% polyacrylamide) and were visualized by anti-SARS rabbit serum. 
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FIGURE 76: ^do H sensitivity of C-terminal truncated spike protein (SA) found in cell lysate 
(lanes 1^) and culture mediiun (lanes 3,4). Positions of internal SA protein and secreted SA 
protdn are madced with arrow heads. 

FIGURE 77: Oligomeric status of the SARS spike protein. Recombinant S protein oligom^ in 
5 COS7 cells transfected with the full-length spike constmct (nSh). The cell lysates were treated 
with DTT and/or heat as indicated above each lane. The different forms of S protein in treated 
and untreated samples were visualized by SDS-PAGE (4% polyacrylamide) and Western blot 
analysis using anti-histidine MAb. 

FIGURE 78: Effect of heat denaturation on the oligomeric status of recombinant S protein in the 
10 absence of DTT. The COS7 cell lysates were heated before the electrophoresis as indicated and 
the S proteins were visualized as described fogjFigure 77. 

FIGURE 79: Effect of heat draaturation on the oligomeric status of spike protein in SARS virion 
particles. SARS-CoV were grown in Vero cells, purified and solubilized from the virion particles 
by SDS, heat-denatured as indicatedand visualized as described in Figure 77, except that rabbit 
15 antiserum against the purified virus was used as a probe. 

FIGURE 80: Analysis of the oligomeric status of SARS virion spike protein by cross-linking 
experiment. Solubilized SARS virion proteins were treated with DMS. Both untreated (-) and 
DMS treated (+) virion proteins were heat denatured in the absence of DTT and visualized by 
4% PAGE followed by silver staining. 

10 FIGURES 81 & 82: Analysis of the oligomeric status of truncated spike protein by heat 
denaturation. Truncated spike protein within COS7 cell lysates (81) or secreted into culture 
medium (82) were heat denatured as indicated in die absence of DTT and visualized by Western 
blot analysis. 

FIGURE 83: Reactivity of deglycosylated full-length spike oligomer with conformational and 
15 non-conformational antibody. The full-length recombinant spike oligomer was partially 
deglycosylated with PNGase F in non denaturating condition and visualized by Western blot 
analysis using anti-histidine Mab Oane 1,23) or rabbit antiserum against purified SARS CoV 
(lane 4,5,6). 

FIGURE 84: Localization of expressed SARS spike proteins in fractionated COS7 cell lysate 
0 visualized by western blot Cells were transfected with indicated plasmids and lysed with 

Dounce homogeniser in hypotonic buffer 48 hr post transfection. Cell lysate was centrifuged to 
obtain soluble cytosol and insoluble membrane fraction that was further solubfized by 4% Triton 
X-100. Proteins were heated with SDS at 80 C and analysed by SDS-PAGE (4-20% 
polyacrylamide) in reducing condtion. Proteins were visualized by anti-histidine Mab. The 
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cytosol fractions were loaded in lanes 1, 3, and 5 and the membrane fractions ware loaded in 
lanes 2, 4, and 6. 

FIGURE 85: Intracellular and surface expression of recombinant fuU-length (AJD) or truncated 
(B^) spike protein in COS7 cells. The cells were fixed at 48 hrs posttransfection and either 
5 treated with deter^nt (Cytofix/perm, BD Biosdences) for intracellular immunofluorescence 

(A3,Q or with 2% paraformaldehyde for cell surface immunofluorescence observation (D33) 
at x40 magnification. Mock transfected cells (C3) were included as controls. 

figures 86-105: SDS-PAGE od E.coli expressed proteins. Tot = total protein; Sol = soluble 
protein fraction. Labels are protein names (Tables 26-30). 

10 FIGURE 106: Immunofluorescence after administration of vector encoding optimsed N anti^n. 
FIGURE 107: Inununofluorescence of (A) native and (B) codon-optimsed M sequences. 

FIGURE 108: Immunofluorescence of (A) native and (B) codon optimsed E sequences. 

FIGURES 109-111: Western blots of Veto cells using rabbit antibodies obtained after . 
immunization with spike proteins expressed in E.coli. 

15 FIGURE 112: Spike protein expression in 293 ceUs. Lanes: (M) Markers; (1) Mock transfected; 
(2,6) cells expressing nS protein, lysate; (3,7) cells expressing nSdTC protein, lysate; (4,8) cells 
expressing nS protein, supernatant; (5,9) (4) cells expressing nSdTC protein, supernatant. 

Staining antibody: (2 to 5) mouse serum obtained after DNA immunization; (6 to 9) rabbit serum . 
obtained after immunization with whole killed virus. 

20 FIGURE 1 13: Six reading frames of SEQ ID NO: 9968. 

HGURE 114: Six reading frames of SEQ ID NO: 10033. 

FIGURE 115: Alignment of bovine coronavirus pol lab (top row; SEQ ID NO: 10068), avian 
infectious bronchitis pol lab (second row; SEQ ID NO: 10069), murine hepatitis virus pol lab 
(third row; SEQ ID NO: 10070), SEQ ID NO^: 9997/9998 (fourth row) and a consensus 
25 sequence (bottom row; SEQ ID NO: 10071). 

FIGURE 116: Schematic of coronavirus genome organization. 

FIGURE 1 17: Schematic of coronavirus ORFla/ORFlb gene products, including region. 

FIGURE 118: Alignment. 

FIGURE 119: Alternative start codons within SEQ ID NO: 1(K)80. 

30 HGURE 120: Six reading firames of SEQ ID NO: 10084. 

HGURE 121: Alignment of SEQ ID NO: 10033 and SEQ ID NO: 10084. 

HGURE 122: Reading frames in SEQ ID NO: 10084. 
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FIGURE 123: Start codon analysis.for SEQ ID NO: 10084. 

FIGURE 124: BLAST analysis of SEQ ID NO: 10210. 

HGURE 125: Epitope analysis of SEQ ID NO: 10210 by either (13A) Hopp & Woods or (13B) 
Kyte & Doolittle. 

5 FIGURE 126: Reading frames in SEQ ID NO: 10299. 

FIGURE 127: Reading frames in SEQ ID NO: 10505. 

FIGURE 128: Reading frames in SEQ ID NO: 11563. 

FIGURE 129: Reading frames in SEQ ID NO: 10033. 

HGURE 130: Alignment of SEQ ID NO: 9997 and SEQ ID NO: 10033. 

10 FIGURE 131: Reading frames in SEQ ID NO: 10299. 

HGURE 132: Reading frames in SEQ ID NO: 10505. . 

HGURE 133: Western Blot of SARS protease purification fractions. 

HGURE 134: Qeava^ of DABCYL-EDANS (a fluorescent tagged peptide with a SARS 
protease cleavage site) by SARS protease at different concentrations. The graph shows 
15 activity/concentration correlations with no protease (♦), 0.95 uM protease (d) and 2.86 uM 
protease (•). 

In the event of a discrepancy between a sequence in the sequence listing and a sequence 
in the drawings, the drawings should take precedence. 

DETAHvED DESCRIPTION OF THE INVENTION 
20 The practice of the present invention will employ, unless otherwise indicated, conventional 

methods of chemistry, biochemistry, molecular biology, immunology and pharmacology, within 
the skiU of the art. Such techniques are explained fuUy in the literature. See, e.g.. Remington's 
Pharmaceutical Sciences, Mack Publishing Company, Easton, Pa., 19th Edition (1995); Methods 
In Enzymology (S. Colowick and N. Kaplan, eds.. Academic Press, fric.); and Handbook of 
25 Experimental Immunology, Vols. I-IV (DM. Weir and C.C. Blackwell, eds., 1986, Blackwell 
Scientific Publications); Sambrook, et aL, Molecular Cloning: A Laboratory Manual (2nd 
Edition, 1989); Handbook of Surface and Colloidal Chemistry (Birdi, K.S. ed., CRC Press, 
1997); Short Protocols in Molecular Biology, 4th ed. (Ausubel et aL eds., 1999, John Wiley & 
Sons); Molecular Biology Techniques: An Intensive Laboratory Course, (Ream et aL, eds., 1998, 
30 Academic Press); PCR (Introduction to Biotechniques Series), 2nd ed. (Newton & Graham eds., 
1997, Springer Verlag); Peters and Dafrymple, Fields Virology (2d ed), Helds et aL (eds.), B 
Raven Press, New York, NY. 
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All publications, patoits and patent applications cited herein, are hereby incorporated by 
reference in their entireties. 

Severe Acute Respiratory Syndrome (SARS) virus has recently been identified as a new 
viral species. The SARS viral species includes the following isolates. 

- two virus isolates described in Peiiis et aL “Coronavirus as a possible cause of severe acute 
lespiiataty syndrome” Lancet published online at 

ht^://tmge.aielancetcom/extms/03ar6477web.p^ on April 8 2003, incorporated herein by 
reference in its entirety and the sequences deposited with GenBank at accession number 
AY268070. 

- the isolates and viral sequences described in Drosten et aL, ‘Identification of a Novel 
Coronavirus in Patients with Severe Acute Respiratory Syndrome”, New England Journal 
of Medicine, published o nlin e at ht^:/Avww.nejm.org on April 10, 2003. 

- the isolf»tp-«! and viral sequences described on the website of the WHO network on March 25 
and24,2003. 

- the isolates and viral sequences described in Tsang et al., ‘‘A Quster of Cases of Severe 
Acute Respiratory Syndrome in Hong Kong” New England Journal of Medicine, published 
online at http://www.nejm.org on March 31, 2003. 

- the isolates and viral sequences described in Poutanen et al., ‘Tdentification of Severe 
Acute Respiratory Syndrome in Canada” New England Journal of Medicine, published 
online at rt/iJj.’/'Wwv.nc/ni.org on March 31, 2003. 

As described in the Lancet article, a 646 base pair polynucleotide from the SARS virus has weak 
homology to viruses of the family Comoaviridae. TTie Lancet article further reports that a 

amino add sequence (of 215 amin o adds) fiom this sequence has about 57% sequence 
homology to tiie RNA polymerase of bovine coronavirus and murine hepatitis virus. 
Phylogenetic analysis of the protein sequences are also presented in the Lancet article showing 
that the polymerase sequence is most closely related to the group II coronaviruses. 

Additional SARS viral isolates can be identified, isolated and/or sequenced by virologists 
skilled in the art. Virologists can readily identify new viral isolates as a SARS virus. Cntena 
which a virologist may use to identify new SARS isolates include: sequence homology of the 
new isolate to known SARS viral isolates; similar genomic organization of the new viral isolate 
to known SARS viral isolates; immunological (serologic) similarity or identity with known 
SARS viral isolates; pathology; and similarity of virion morphology with known SARS viral 
isolates; and similarity of infected cell morphology as tiiat caused by known SARS viral isolates 
(visualized, for instance, by electron microscopy). 

hfethods for isolating and sequencing SARS viral isolates include die methods described 
by Peiris et ol. in tiie Lancet paper. As reported in the Lancet paper, RNA from clinical samples 
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can be reverse transcribed with random hexamers and cDNA can be amplified with primers 
having sequences of SEQ ID NOS: 6584 & 6585 in the presence of 2.5 mmol/L magnefrinm 
chloride (94°C for 1 min, 50°C for 1 min , and 72®C for 1 min). 

Reverse transcription of a viral isolate using random hexamers can be accomplished in an 
5 RT-PCR assay as follows. Virus isolates are propagated on mammalian cells, particularly fetal 
rhesus kidney cells. Total RNA from virus-infected and virus-uninfected fetal rhesus kidney 
cells is then isolated. RNA samples are reverse transcribed with a primer having SEQ ID NO: 
6586. cDNA can be amplified by a primer having SEQ ID NO: 6587. Unique PCR products (in 
size) in the infected cell preparation are then cloned and sequenced, and genetic homology of the 
10 sequence compared with those in GenBank. 

One skilled in die art would be able to identify and clone additional genomic regions using 
a variety of standard cloning techniques, such as, for example, using random primer RT-PCR 
and detection of sequences overl^iping one or more of the above sequences, and/or uring 
oligonucleotide primers, e.g., degenerate primers, based on the sequences provided herein (see 
15 Hguies 1-5, Rgures 8-11, SEQ ID NOS: 3-20). 

Qoning, sequencing and identification of SARS virus by one skilled in the art can be 
further facilitated by the use of polynucleotide sequences, particularly RNA polymerase 
sequences, from related Coronaviruses. 

Sequence homology of new viral isolates with the known SARS isolates described above 
20 can be readily determined by one skilled in the art. New SARS isolates may be identified by a 
percent homology of viral nucleotide sequences of 99%, 95%, 92%, 90%, 85%, or 80% 
homology of the new virus to known SARS viral polynucleotide sequences. New SARS isolates 
may also be identified by percent homology of 99%, 95%, 92%, 90%, 85%, or 80% homology of 
the polypeptides encoded by the polynucleotides of the new virus and the polypeptides encoded 
25 by known SARS virus. 

New SARS isolates may also be identified by a percent homology of 99%, 95%, 92%, 

90%, 85%, or 80% homology of tiie polynucleotide sequence for specific genomic regions for 
the new virus with the polynucleotide sequence for specific gnomic regions of the known SARS 
viruses. Additionally, new SARS isolates may be identified by a percent homology of 99%, 

10 95%, 92%, 90%, 85%, or 80% homology of the polypeptide sequence encoded by die 

polynucleotide of specific gmiomic regions of the new SARS virus to the polypeptide sequence 
encoded by the polynucleotides of specific regions of the known SARS virus. These genomic 
regions may include regions (e.g., ^ne products) which are typically in common among 
numerous coronaviruses, as well as group specific tenons (e.g., antigenic groups), such as, for 
»5 example, any one of the following genomic regions which could be readily identified by a 
virologist skilled in the art* S'untranslated region (UTR), leader sequence, ORFla, ORFlb, 
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nonstnictural protein 2 (NS2), hemagglutinin-estanse glycoprotein (HE) (also lefened to as E3), 
spike glycoprotein (S) (also refened to as E2), ORF3a, ORF3b, ORF3x, nonstnictural protein 4 
(NS4), envelope (smaU membrane) protein (E) (also referred to as sM), membrane glycoprotein 
(M) (also referred to as El), ORF5a, ORF5b, nucleocapsid phosphoprotein (N), ORF7a, ORE’Tb, 
5 intergenic sequences, 3’UTR, or RNA dependent RNA polymerase (pol). The SARS virus may 

have identifiable genomic regions with one or more the above-identified genomic regions. A 
SARS viral antigen includes a protein encoded by any one of these genomic regions. A SARS 
viral antigen may be a protein or a fragment thereof, which is highly conserved with 
coronavinises. A SARS viral antigen may be a protein or fragment thereof, which is specific to 
10 the SARS virus (as compared to known cornaviruses). (See, Figures 1-5, Hgures 8-1 1, SEQ ID 
NOS: 3-20). 

One skilled in the art could also recognize electron microscopy of a SARS virus infected 
mammalian cell. Electron microscopy of SARS infected cells are shown in the Lancet paper. 

As discussed in the paper, electron.microscopy of negative stained (3% potassium phospho- 
15 tungstate, pH 7.0) ultracentiifuged cell-culture extracts of SARS infected fetal rfiesus kidney 
cells show the presence of pleomorphic enveloped virus particles of around 80-90 nm (range 70- 
130 nm) in diameter with surface morphology compatible with a coronavirus (see Lancet paper. 
Figure 1). Thin-section electron microscopy of infected cells reveals virus particles of-55-90 nm' 
diameter within smooth walled vesicles in the cytoplasm (see Lancet paper. Figure 2B). 

20 Electron microscopy can also be used to observe virus particles at the cell surface. Electron 

microscopy of a human lung biopsy sample depicts similar viral morphology. See Lancet paper 
Figure 2A. 

7 . SARS POLYPEPTIDES AND POLYNUCLEOTIDES 

The invention relates to nucleic adds and proteins from SARS virus. Such polynucleotides 
25 and polypeptides are exemplified further below. 

In one embodiment, the polynucleotides of the inv^tion do not include one of the 
following five primers, disclosed at ht^://contenLngm.org/cgi/reprint/NEJMoa03078Iv2.pdf. SEQ ID 
NOS: 6034-38. 

The invention also includes polynucleotide sequences which can be used as probes for 
(0 diagnostic reagents, kits (comprising such reagents) and methods which can be used to diagnose 
or identify the presence or absence of a SARS virus in a biological sample. The invention 
includes a polynucleotide sequence comprising one or more of the primer sequences identified in 
SEQ ID NOS: 21-1020. The invention further includes polynucleotide sequence comprising the 
complement of one or more of the prim^ sequences identified in SEQ ID NOS: 21-1020. 
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Hie invention includes a polyp^tide sequence comprising an amin o acid sequence fipom 
the sequence shown in Hgure 23. Such amino acid sequences are SEQ ID NOS: 6588-6809. The 
invention includes polypeptides comprising an amino acid sequence having sequence identity to 
these sequences, and the invention includes a fragment of a polyp^tide comprising one of these 
5 sequences. 

The invention includes a polypeptide comprising an amino acid sequence from the 
sequence shown in Figure 24. Such amin o add sequences are SEQ ID NOS: 6810-7179. The 
invention includes a protein comprising an amino add sequence having sequence identity to 
these sequences, and the invention includes a fragment of a protein comprising one of these 
10 sequences. 

The invention includes a protein comprising SEQ ID NO: 6039. The invention includes a 
polypeptide comprising an amino acid sequence having sequence identity to SEQ ID NO: 6039. 
The inv^tion includes a hagment of a polypeptide comprising SEQ ID NO: 6039. The 
invention includes a di^nostic kit comprising a polypeptide comprising SEQ ID NO: 6039, or a 
15 fragment thereof. The invention includes a diagnostic kit comprising a polynucleotide sequence- 
encoding SEQ ID NO: 6039, or a fragment thereof. The invention includes an immunogenic 
composition comprising a polypqptide comprising SEQ ID NO: 6039, or a fragment thereof. 

The invention includes an antibody which recognizes a polypeptide comprising SEQ ID NO: 
6039, or a fragment thereof. SEQ ID NO: 6039 demonstrates functional homology with ORFla 
20 of coronaviruses. 

Predicted transmemhrane or hydrophobic regions of SEQ ID NO: 6039 are identified 
below. Although the polyprotein of coronaviruses is proteolytically cleaved into numerous 
smaller proteins, hydrophobic domains in the polyprotein are known to mediate the membrane 
association of the replication complex and to be able to dramatically alter the architecture of host 
25 cell membranes. Accordingly, the hydrophobic domains of the polyprotein are targets for 

genetic mutation to develop attenuated SARS virus vaccines. The hydrophobic domains are also 
targets for small molecule inhibitors of the SARS virus. The hydrophobic domains may also be 
used to generate antibodies specific to those regions to treat or prevent SARS virus infection. 

Predicted Transmenibraiie Helices in SEQ ID HO: 6039 

30 

The sequence positions in brackets denominate the core region. 

Only scores above 500 cure considered significant. 

Inside to outside helices : 43 found Outside to inside helices : 43 found 
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1071 


(1071)1089 


(1086) 


243 


1078 


776. 


( 776) 791 


( 791) 


1435 


783 


1121 


(1121)1137 


(1137) 


459 


1130 


1068 


(1071)1089 


(1086) 


370 


1078 


1679 


(1679)1696 


(1696) 


404 


1686 


1121 


(1121)1137 


(1137) 


455 


1130 


2098 


(2102)2119 


(2116) 


509 


2109 


1679 


(1679)1696 


(1694) 


340 


1686 


2145 


(2145)2160 


(2160) 


797 


2153 


2098 


(2098)2119 


(2116) 


678 


2109 


2206 


(2209)2224 


(2224) 


2686 


2216 


2148 


(2148)2163 


(2163) 


434 


2155 


2316 


(2316)2332 


(2332) 


2123 


2325 


2208 


(2210)2231 


(2226) 


2389 


2219 


2335 


(2339)2358 


(2354) 


2101 


2346 


2309 


(2309)2332 


(2326) 


1773 


2318 


2373 


(2373)2390 


(2390) 


532 


2380 


2342 


(2342)2368 


(2360) 


1666 


2353 


2597 


(2600)2615 


(2615) 


307 


2607 


2373 


(2373)2390 


(2390) 


254 


2380 


2753 


(2753)2770 


(2768) 


2242 


2760 


2753 


(2755)2770 


(2770) 


2119 


2763 


2831 


(2833)2854 


(2851) 


759 


2841 


2832 


(2835)2854 


(2851) 


687 


2844 


2879 


(2882)2900 


(2897) 


526 


2889 


2858 


(2858)2873 


(2873) 


253 


2866 


2990 


(2996)3012 


(3010) 


1289 


3003 


2879 


(2882)2899 


(2899) 


400 


2889 


3024 


(3024)3042 


(3039) 


2281 


3032 


2990 


(2990)3005 


(3005) 


875 


2998 


3054 


(3058)3075 


(3072) 


2536 


3065 


3020 


(3024)3042 


(3042) 


2795 


3032 


3105 


(3109)3127 


(3123) 
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3116 


3059 


(3059)3075 


(3075) 


2137 


3067 


3143 


(3143)3163 


(3159) 
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(3108)3127 


(3123) 
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3115 


3253 


(3255)3272 
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3343 


(3351)3366 


(3366) 
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485 


3897 


3710 


(3713)3738 
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(4035)4051 
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2367 
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4217 


(4217)4232 


(4232) 
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(3884)3902 
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4239 


(4239)4257 
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402 
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(4234)4254 


(4249) 


325 


4241 












4338 


(4341)4360 


(4360) 
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Accordingly, die invention includes a polypeptide comprising a fragment of SEQ ID NO: 
6039, wherein said fragment comprises an amin o acid sequence including one or more of the 
hydrophobic transmembrane sequences identified above. The invention includes a polypeptide 
5 comprising a fragment of SEQ ID NO: 6039 wherein said fiagment comprise one or more of the 

following polypeptide sequences of SEQ ID NO: 6039: 473-488, 529-549, 584-606, 773-791, 
2098-2119, 2145-2160, 2206-2224, 2316-2332, 2335-2358, 2373-2390, 2753-2770, 2831-2854, 
2879-2900, 2990-3012, 3024-3042, 3054-3075, 3105-3127, 3438-3455, 3559-3584, 3589-3606, 
3611-3629, 3659-3674, 3756-3777, 473-488, 583-606, 776-791, 2098-2119, 2208-2231, 2309- 
10 2332, 2342-2368, 2753-2770, 2832-2854, 2990-3005, 3020-3042, 3059-3075, 3105-3127, 3142- 

3162, 3437-3453, 3560-3577, 3591-3606, 3610-3627, 3656-3678, 3710-3738, 3723-3738, and 
3760-3777. Preferably, the fragment comprises one or more of the following polypeptide 
sequences of SEQ ID NO: 6039: 2206-2224, 2316-2332, 2335-2358, 2753-2770, 3024-3042, 
3054-3075, 3105-3127, 3589-3606, 3611-3629, 3756-3777, 2208-2231, 2753-2770, 3020-3042, 
15 3059-3075, and 3591-3606. Preferably, the fragment comprises one or more of the following 
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10 



15 



20 



25 



30 



35 



polypeptide sequences of SEQ ID NO: 6039: 2206-2224 and 3020-3042. The invention also 
includes polynucleotides encoding each of the polypeptide fragments identified above. 

Ibe invention includes an attenuated SAKS virus wherein said attenuated SAKS virus 
contains an addition, deletion or substitution in the polynucleotides encoding for one of the 
hydrophobic domains identified above. The invention also includes a method for creating an 
attenuated SARS virus comprising mutating a SARS virus by adding, deleting or substituting the 
viral genome of the SARS virus to alter the coding of one or more of the hydrophobic domains 
of SEQ ID NO: 6039 identified above. 

Tlie invention includes an antibody which specifically identifies one or more of the 
hydrophobic regions of SEQ ID NO: 6039 identified above. The invention includes a small 
molecule which binds to, interferes with the hydrophobicity of or otb^wise disrupts one or more 
of tt»e hydrophobic regions of SEQ ID NO: 6039 identified above. 

Predicted N-glycosylation sites of SEQ ID NO: 6039 are identified in the chart below. 

Prediction o£ N-glycosylatioa sites in SEQ ID NO: 6039 

' Position Potential Jury NGlyc 

agreement result 

48 NGTC SEQ ID NO: 7180 0.6371 (7/9) + 

.389 NHSN SEQ ID NO: 7181 0.6132 (6/9) + 

916 NESS SEQ ID NO: 7182 0.5807 (7/9) + 

1628 NHTK SEQ ID NO: 7183 0.5610 (7/9) + 

1696 NKTV SEQ ID NO: 7184 0.5297 (5/9) + 

2031 NPTI SEQ ID NO: 9764 0.5299 (5/9) + WARNING: PRO- 

XI. 

2249 NSSN SEQ ID NO: 7185 0.6329 (9/9) ++ 

2459 NPTD SEQ ID NO: 9765 0.5599 (6/9) + WARNING: PRO- 

XI. 

2685 NVSL SEQ ID NO: 7186 0.6071 (8/9) + 

4233 NATE SEQ ID NO: 7187 0.6144 (7/9) + 

Accordingly, the inv^tion comprises a fiagment of SEQ ID NO: 6039 wherein said 
fragment comprises an amino acid sequence which includes one or more of the N-glycosylation 
sites identified above. Preferably, the Augment comprises one or more sequences selected from 
the group consisting of SEQ ID NOS: 7180-7187 & 9764-9765. Preferably, the fragment 
comprises the amino acid sequence NSSN (SEQ ID NO: 7185). 

The invention includes a polypqptide comprising a fragment of SEQ ID NO: 6039 wherein 
said fiagment does not include one or more of the glycosylation sites identified above. The 
invention also includes a polynucleotide encoding such a polypeptide. 

T-epitopes for SEQ ID NO: 6039 are identified in Table 13. The invention includes a 
polypeptide for use as an antigen, whramn the polypeptide comprises: (a), an amino acid 
sequence selected from the group consisting of the T-epitope sequences identified as SEQ ID 
NOS: 7400-7639; (b) an amino add sequence having sequence identity to an amin o add 
sequence of (a). The invention furtti^ comprising a polynucleotide sequence encoding the 
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polypeptides of (a) or Cj). The invention further comprising a method of expression or delivCTy 
of such polynucleotides througjb viral vectors and/or viral particles. The invention further 
conq>rises a polypeptide coursing two or more of the T-epitope sequences identified as SEQ 
ID NOS: 7400-7639, or a polynucleotide encoding such a polypeptide. 

5 The use as an antigen is preferably a use: (1) as a T-cell antigen; (2) for generating a 

complex between a class I MHC protein (e.g. a class I HLA) and a fragment of said antigen; (3) 
as an antigen for raisiug a cell-mediated immune response; and/or (4) as an antigen for raising a 
CTL response. The use preferably protects or treats disease and/or infection caused by a S ARS 
virus. 

10 The invention provides the use of a pblypeptide in the manufacture of a medicament for 

immunising a mammal (typically a human) against SARS viral infection wherein the polypeptide 
is as defined above. 

The invention provides a method of raising an immune response in a mainTnal (typically a 
human), comprising the step of administering to the Tnammal a polypeptide as defined above, 

15 wherein said immune response is a ceU-medialed imnnmft response and, preferably, a CTL 
response. The immune response is preferably protective or ther^utic. 

The ORFla and ORFlb sequences of coronaviruses are typically translated as a single 
ORFlab polyprotein. Slippage of the ribosome during translation generates an a-1 frameshift. 
One region of such slippage is illustrated below: 

50 999ttttacacttagaaacacagtctgtaccgtctgcggaatgtggaaaggttatggctgtagttgtga 

+1 gftlrntvctvcgmwkgygcs cd 

+3GPYT-KHSLyRLRNV ERI.WL-L - 

ccaactccgcgaacccttgatgcagtctgcggatgcatcaacgtttttaaaogggtttgcggtgtaagt 

+1 QI*REPLMQSADASTFLNGFAV-V 
55 +3PTPRTLDAVCGC1NVFKRVCGVS 

gcagcccgtcttacaccgtgcggcacaggcactagtactg (SEQ ID NO: 7224) 

+1 QPVLHRAAQA LVL (SEQ ID NOS: 7225-7226) 
+3AARI.TPCGTGTST (SEQ ID NOS: 7227-7229) 

0 which would generate the following translational slippage (SEQ BD NOS : 7230-723 1): 

ccaactccgcgaacccttgatgcagtctgcggatgcatcaacgtttttaaacgggtttgcggtgtaagt 

QLRBPLMQSADASTFIiNRVCGVS 

Accordingly, the invention includes a polypeptide comprising SEQ ID NO: 7232. TTie 
5 invention includes a polypeptide cono^rising an amino add sequence having sequence identity to 
SEQ ID NO: 7232. The invention includes a fragment of a polypeptide comprising SEQ ID NO: 
7232 Hie invention includes a diagnostic kit comprising a polypeptide comprising SEQ ED NO: 
7232 or a fiagment tiiereof. The invraition includes a diagnostic kit comprising a polynucleotide 
sequence encoding SEQ ID NO: 7232 or a fragment thereof. The invention includes an 
0 immunogenic composition comprising a polypeptide comprising SEQ ID NO: 7232 or a 
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fragment thereof. Hie invention includes an antibody which recognizes a polypeptide 
comprising SEQ ID NO: 7232 or a fragment tiiereof. 

The invention also includes a polypeptide comprising amino acid sequence X 1 -X 2 -X 3 , 
where Xi is SEQ ID NO: 7233, X 2 is from one to ten amino acids, and X 3 is SEQ ID NO: 7234. 

5 X 2 can comprise any sequence of one to ten amin o acids (SEQ ID NOS: 7235-7244) but, in 

preferred embodiments, X 2 is selected from the group consisting of F, FL, FLN, FLNR (SEQ ID 
NO: 7245), FLNRV (SEQ ID NO: 7246) and FLNRVC (SEQ ID NO: 7247). Preferably, X 2 is 
SEQ ID NO: 7247. These preferred embodiments are shown as SEQ ID NOS: 7248-7253. 

The invention includes a polypeptide comprising an amin o acid sequence having sequence 
10 identity to said amin o acid sequence X 1 -X 2 -X 3 . The invention includes a fragment of a 
polypeptide comprising said amin o acid sequences X 1 -X 2 -X 3 . Hie invention includes a 
diagnostic kit comprising a polypeptide comprising said amino acid sequences X 1 -X 2 -X 3 or a 
fragment thereof. The invention includes a diagnostic kit comprising a polynucleotide sequence 
encoding said amin o acid sequences X 1 -X 2 -X 3 or a fragment thereof. The invention includes an - 
15 immunogenic composition comprising a polypeptide comprising said amino acid sequences 
X 1 -X 2 -X 3 or a fragment thereof. The invention includes an antibody which recognizes a 
polypeptide comprising said amin o acid sequ^ces X 1 -X 2 -X 3 or a fragment thereof. 

The amino acid sequences X 1 -X 2 -X 3 (Le. SEQ ID NOS: 7235-7244) demonstrate 
functional homology with the polyprotein of muri n e hepatitis virus. This polyprotein is cleaved 
20 to produce multiple proteins. Proteins which can be generated from the X 1 -X 2 -X 3 polyprotein, 
where X 2 is six amin o acids (SEQ ID NO: 7240) are listed below. 



Mouse virus protein 


Coordinates in Mouse vims 


Coordinates in SEQ ID NO: 7240 


Nsp2 


3334-3636 


3241-3546 


Nsp3 


3637-3923 


3547-3836 


Nsp4 


3924-4015 (or 4012) 


3837-3919 


Nsp5 


4016(cff4013>4209 


39204117 


Nsp6 


42104319 


41184230 


Nsp7 


43204456 


4231-4369 


Nsp9 


4457-5384 


4370-5301 


NsplO 


5385-5984 


5302-5902 


Nspll 


5985-6505 


5903-6429 


Nspl2 


6506-6879 


64306775 


Nspl3 


6880-7178 


6776-7073 



Hie invention includes a fragment of the amino acid sequence X 1 -X 2 -X 3 (Le. SEQ ID 
NOS: 7235-7244) wh«ein die fragment comprises one of the polypeptide sequ»ices identified in 
25 the above table. The invention further includes a fragment of the mnino acid sequence X 1 -X 2 -X 3 
wherein said fragment comprises a polypeptide sequence which has a serine at its N-terminus 
and a glutamine at its C-traninus. The invention further includes a fragment of the amino acid 

sequence Xi-X2-X3 wherein said fragment comprises a polypeptide sequence which has an 
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Alanine at its N-traminus and a glutamine at its C-tenninus. llie invention further includes a 
fragment of the amino acid sequence X 1 -X 2 -X 3 wherein said fragment comprises a polypeptide 
sequence which has a Asparagine at its N-terminus and a glutamine at its C-terminus. The 
invention furthCT includes a fragment of the amin o acid sequence Xi-X 2 -X 3 wherein said 
5 fragment comprises a Cysteine at its N-teiminus and a Glutamine at its C-t^minus. Each of the 
fragments identified above can be used in fusion proteins. 

The invention includes a diagnostic kit comprising a polypeptide comprising at least one of 
the fragments of the amino acid sequence X 1 -X 2 -X 3 (Le. SEQ ID NOS: 7235-7244) identified in 
the above paragraph. The invention includes a diagnostic kit comprising a polynucleotide 
10 sequence encoding at least one of the fragments of the ami no acid sequence X 1 -X 2 -X 3 identified 

in the above paragraph. The invention includes an immunogenic composition comprising a 
polypeptide comprising at least one of the fragments of the amino acid sequence X]-X 2 -X 3 
identified in the above paragraph. The invention includes an antibody which recognizes a 
polypeptide comprising at least one of the fragments of the amino acid sequence X 1 -X 2 -X 3 
15 identified in the above paragraph. 

Predicted N-glycosylation sites of the amin o acid sequence X 1 -X 2 -X 3 when X 2 is six amino 
acids are identified at the asparagines located at the following amino acid positions 48; 389; 556; 
916; 1628; 1696; 1899; 2079; 2249; 2252; 2507; 2685; 3303; 3373; 3382; 3720; 4150; 4233; 
4240; 5016; 5280; 5403; 5558; 5650; 5905; 6031; 6130; 6474; 6918; 6973. Accordingly, the 
20 invention comprises a fragment of SEQ ID NO: 7239 wherein said fragment is at least ten amino 
acids and whermn said fragment comprises one or more of the asparagines from the amino acid 
positions of SEQ ID NO: 7239 selected from the group consisting of 8 ; 389; 556; 916; 1628; 
1696; 1899; 2079; 2249; 2252; 2507; 2685; 3303; 3373; 3382; 3720; 4150; 4233; 4240; 5016; 
5280; 5403; 5558; 5650; 5905; 6031; 6130; 6474; 6918; and 6973. 

25 A zinc binding region 2 site within SEQ ID NOS: 7235-7244 is identified at amino acid 

residues 2102-21 12 (SEQ ID NO: 7254 HGIAAINSVPW). Ihe polypeptide of SEQ ID NOS: 
7235-7244 will be processed by the SARS virus into multiple peptides. This zinc binding region 
falls within the nspl region of the polypeptide. SEQ ID NO: 7254 is a target for screening of 
chemical inhibitors to the SARS virus, llie invention includes a polypeptide comprises SEQ ID 
30 NO: 7254. The invention includes a polynucleotide encoding the polypeptide sequence of SEQ 
ID NO: 7254. The invention includes a method of screening SEQ ID NO: 7254 for an inhibitor. 
Hie invention includes the recombinant expression of SEQ ID NO: 7254 in a host cell. The 
invention includes a fragment of SEQ ID NOS: 7235-7244, wherein said fragment comprises 
SEQ ID NO: 7254. The invention includes a polypeptide comprising SEQ ID NO: 7254 wherein 
35 said polypeptide is complexed with a zinc ion. The invention includes a small molecule which 
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prevents a zinc ion from complexing with the polyp^tide of SEQ ID NO: 7254. Hie invention 
includes a fusion protein wherein said fusion protein comprises SEQ ID NO: 7254. 

The polyprotein encoded by the SARS virus will contain at least two protease domains: a 
papain-like cystein protease (PIP) and a chymotrypsin-picomavirus 3C-like protease (SCl^)). 

5 (Hiere may be more than one copy of the PLP domain). These proteases function to cleave the 
polyprotein into multiple smaller proteins. The 3C-like protease, also known as the main 
protease” or h^ro, is itself cleaved from the polyprotein by its own autoprotease activity. See 
generally, Chapter 35 of Fields Virology (2nd ed), Kelds et al (eds.), B.N. Raven Press, New 
York, NY, and Anand et al., EMBO Journal (2002) 21 (13): 3213-3224. This 3CLp generally 
10 corresponds with the Nsp2 region identified above. 

The SARS virus 3CLp protein is furthra: characterized by SEQ ID NO: 6569 (also SEQ ID 
NO: 9769), as shown in FIGURE 15. 

FIGURE 16 also illustrates the SARS virus 3CLp, in allignment with the 3CLp of avian 
infectious bronchitis (IBV; SEQ ID NO: 6570), mouse hepatitis virus (MHV; SEQ ID NO: 

15 6571), and bovine coronavirus (BCoV; SEQ ID NO: 6572). Accordingly, the invention includes 

a polypeptide sequence comprising SEQ ED NO: 6569, or a fragment thereof, or a polypeptide 

sequence having sequence identity thereto. The invraition further includes a polynucleotide 
sequence encoding SEQ ID NO: 6569, or a fragment thereof. The invention includes a 
polynucleotide sequence encoding a polypeptide sequence having sequence identity to SEQ ID 
20 NO: 6569. 

The invention further includes a method of screening for an inhibitor of the SARS virus 
3CLp protein. In one embodiment, the invention includes a method of screening for an inhibitor 
of SEQ ID NO: 6569. The invention includes a method of recombinantly expressing the SARS 
virus 3CLp protein in a host cell. The invention includes a method of recombinantly expressing 
25 a polypeptide sequence comprising SEQ ID NO: 6569 or an enzymatically active fragment 
thereof or a polypeptide sequence having sequence identity thereto. The invention includes a 
small molecule which inhibits or reduces the proteolytic activity of the SARS virus 3CI^ 
protein. The invention includes a small molecule which inhibits or reduced the proteolytic 
activity of the polypeptide comprising SEQ ID NO: 6569. 

30 Catalytic residues of the SARS virus 3CI^ are identified in FIGURE 15 and 16. 

Specifically, a catalytic histidine and a catalytic cysteine are identified. Such catalytic sites are 
targp.tg for small molecules which could inhibit or reduce the protease activity of 3CLp. 
Accordingly, the invention includes a polypeptide comprising a fragment of SEQ ID NO: 6569, 
wherein said fragment comprises at least one catalytic site. Preferably, the catalytic site is 
35 selected from the group consisting of the indicated catalytic histidine and the catalytic cysteine in 

FIGURE 15 and 16. The invention includes a polynucleotide encoding a polypeptide, wherein 
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said polypeptide comprises a firagment of SEQ ID NO: 6569, wherein said fragment comprises at 
least one catalytic site. Preferably, the catalytic site is selected from the group consisting of tiie 
indicated catalytic histidine and the catalytic cysteine. 

Hie invention further includes a method of screening a compound library to identify a 
5 small molecule which inhbits a catalytic site of a SARS virus 3CLp. Preferably, the 30Lp 
comprises SEQ ID NO: 6569, or a fragment thereof, or a sequence having sequence idratity 
thereto. The catalytic site is preferably selected from the group consisting of the indicated 
catalytic histidine and the catalytic cysteine in FIGURE 15 and 16. 

The invention includes a small molecule which inhibits the catalytic site of a SARS virus 
10 3CLp. Ptefeably, tiie 3CLp comprises SEQ ID NO: 6569, or a fragment thereof, or a sequence 
having sequence identity thereto. The catalytic site is preferably selected from the group 
consisting of the indicated catalytic histidine and the catalytic cysteine in FIGURE 15 and 16. 

Residues of the substrate site of the SARS virus 3CLp are identified in FIGURE 15 aijd 16. 
Specifically, a substrate site is indicated at a phenylalanine, a tyrosine and a histidine. Such 
15 substrate sites are targets for small molecules which could inhibit or reduce the protease activity 
of 3C14>- Accordingly, the invention includes a polypeptide comprising a fragment of SEQ ID 
NO:. 6569, wherein said fragment comprises at least one substrate site. Preferably, the substrate 
site is selected from the group consisting of the indicated substrate phenylalanine, tyrosine and 
histidine in FIGURE 15 and 16. The invention includes a polynucleotide encoding a 
20 polypeptide, wherein said polypeptide comprises a fragment of SEQ ID NO: 6569, wherein said 
fragment comprises at least one substrate site. Preferably, the substrate site is selected from the 
group consisting of the indicated substrate phenylalanine, tyrosine and histidine in FIGURE 15 
and 16. 

The invention further includes a method of screening a compound library to identify a 
25 small molecule which blocks a substrate site of a SARS virus 3CLp. Preferably, the 

comprises SEQ ID NO: 6569, or a fragment thareof, or a sequence having sequence identity 
thereto. The substrate site is preferably selected from the group consisting of the indicated 
substrate phenylalanine, tyrosine and histidine in FIGURE 15 and 16. 

The invention includes a small molecule which inhibits the substrate site of a SARS virus 
30 3CLp. Preferably, the 3CLp comprises SEQ ID NO: 6569, or a fingment thereof, or a sequence 
having sequence identity thereto. The substrate site is preferably selected from the group 
consisting of the indicated substrate phenylalanine, tyrosine and histidine in FIGURE 15 and 16. 

The invention furthm: includes a diagnostic kit comprising a polynucleotide encoding a 
SARS virus 3CLp or a fragmmit thereof. Preferably, the SARS virus 3CLp comprising SEQ ID 
35 NO: 6569 or a fragment thereof or a polypeptide sequence having sequence identity thereto. 
Preferably, the fragment comprising one or more sites selected Ihom the group consisting of a 
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catalytic site and a substrate site. Preferably, the catalytic site is selected from the group 
consisting of one or more of the sites identified in HGURE 15 and 16. Preferably, the substrate 
site is selected from the group consisting of one or more of the sites identified in FIGURE 15 
and 16. 

5 The invention further comprises a diagnostic kit comprising an antibody specific to a 

SARS vims 3CLp or a fragment thereof. Preferably, the antibody is specific to the polypeptide 
comprising SEQ ID NO: 6569 or a fragment thereof or a polypeptide sequence having sequence 
identity thereto. Preferably, the antibody is specific to one or more sites of a SARS virus 3CLp 
selected from the group consisting of a catalytic site and a substrate site. Preferably, the catalytic 
10 site is selected from die group consisting of one or more of the sites identified in FIGURE 15 

and 16. Preferably, the substrate site is selected from the group consisting of one. or more of the 
sites identified in FIGURE 15 and 16. 

The invention includes a polypeptide comprising an amino acid sequence from the 
sequence shown in Rgure 25. Die two amino acid sequences within Figure 25, separated by a *, 
15 are SEQ ID NOS: 7188 & 7189. The invention includes a polypeptide comprising an amino acid 
sequence having sequence identity to the figure 25 translation. Die invention includes a 
fragment of a polypeptide comprising the figure 25 sequence. The invention includes a 
diagnostic kit comprising a polypeptide comprising the figure 25 translation, or a fragment 
thereof. Die invention includes a diagnostic kit comprising a polynucleotide sequence encoding • 
20 die figure 25 translation, or a fragment thereof. The invention includes an immunogenic 

composition comprising a polypeptide comprising the figure 25 translation, or a fragment 
thereof. The invention includes an antibody which recognizes a polypeptide comprising the 
figure 25 sequence, or a fragment thereof. The figure 25 sequence demonstrates functional 
homology with ORFlb of coronaviruses. 

25 SEQ ID NO: 7188 is an opai reading frame within Figure 25. The invention includes a 

polypeptide comprising SEQ ID NO: 7188. The invention includes a polypeptide comprising an 
amino acid sequence having sequence identity to SEQ ID NO: 7188. The invention includes a 
fragment of a polypqptide comprising SEQ ID NO: 7188. The invention includes a diagnostic 
kit comprising a polypeptide comprising SEQ ID NO: 7188, or a fragment thereof. The 
30 invention includes a diagnostic kit comprising a polynucleotide sequence encoding SEQ ID NO: 
7188, or a fragment diCTeof. Die invention includes an immunogenic composition comprising a 
polj^tide comprising SEQ ID NO: 7188, or a fragment thereof. The invention includes an 
antibody which recognizes a polypeptide comprising SEQ ID NO: 7188, or a fragment thereof. 

SEQ ID NO: 7190 is an open reading firame within SEQ ID NO: 7188. Die invention 
35 includes a polypeptide comprising SEQ ID NO: 7 190, a fragment thereof or a polypeptide 

having sequence identity tiiereto. Die invention further includes a polynucleotide encoding SEQ 
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ID NO: 7190, a fragment thereof or a polypeptide sequence having sequence identity thereto. 

An example of a polynucleotide encoding SEQ ID NO: 7190 is given as SEQ ID NO: 7191. 

SEQ ID NO: 7188 also contains an open reading frame comprising SEQ ID NO: 6042. 

The invention includes a polypeptide com prisin g SEQ ID NO: 6042. The invention includes a 
5 polypeptide comprising an amino add sequence having sequence identity to SEQ ID NO: 6042. 
llie invention includes a fragment of a polyp^tide comprising SEQ ID NO: 6042. The 
invention includes a diagnostic kit comprising a polypeptide comprising SEQ ID NO: 6042, or a 
fragment thereof. The invention includes a diagnostic kit comprising a polynucleotide sequence 
encoding SEQ ID NO: 6042, or a fragment thereof. The invention includes an immunogenic 
10 composition comprising a polypeptide comprising SEQ ID NO: 6042, or a fragment thereof. 

The invention includes an antibody which recognizes a polypeptide comprising SEQ ID NO: 
6042, or a fragment thereof. SEQ ID NO: 6042 demonstrates functional homology to a 
coronaviius spike protein. 

Predicted transmembrane re^oiis of SEQ ID NO: 6042 are identified below. 

15 Predicted. Transmembrane helices of SEQ XD KO; 6042 

The sequence positions in brackets denominate the core region. 

Only scores above 500 are considered significant. 

Inside to outside helices ; 18 found Outside to inside helices : 13 found 
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SEQ ID NO: 6042, the spike protein, is a surface exposed polypeptide. Recombinant 
expression of a protein can be hindered by hydrophobic transmembrane regions. Accordingly, 
the invKition includes a polypeptide comprising SEQ ID NO: 6042 wherein one or more of the 
hydrophobic regions identified above is removed. The invention further includes a 
25 polynucleotide encoding such a polypeptide. The invention includes recombinantly expressing 
the protein in a host ceU. Primers for amplifying the gene for spike protein and fingments 
thereof, such as fragments mcoding the soluble ectodomain, include SEQ ID NOS: 9753-9763 
QCiao et al. (2003) Biochem Biophys Res Comm 312:1159-1 164). 

FurthCT characterization of SEQ ID NO: 6042 is set forth below. 

FSOBT Prediction of Protein Loc 2 J.lzation Sites 
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version 6.4 (WWW) 

SEQ ID NO: 6042 - 1255 Residues 
Species classification: 4 

*** Reasoning Step: 1 

Preliminary Calculation of AIiOM (threshold: 0.5) 
count : 2 

Position of the most N-terminal TMS: 496 at i=2 
MTOP: membrane topology (Hartmann et al.) 

I(middle): 503 Charge diffirence(C-N) : 1-0 

McG: Examining signal sequence (McGeoch) 

Length of UR: 13 

Peak Value of UR: 3.28 

Net Charge of CR: 0 
Discriminant Score : 8.66 

GvH: Examining signal sequence (von Heijne) 

Signal Score (-3.5): 5.94 
' Possible cleavage site: 13 

»> to have a cleavable N-term signal seq» 

Amino Acid Composition of Predicted Mature Form: 
calculated from 14 

AIiOM new cnt: 1 ** thrshld changed to -2 
Cleavable signal was detected in ALOM?: OB 
ALOM: finding transmembrane regions (Klein et al.) 
count: 1 value: -12.26 threshold: -2.0 

INTEGRAIi Likelihood =-12.26 Transmenibraiio 1202-1218 (1194-1228) 

PERIPHERAL Likelihood = 0.16 

modified ALOM score: 2.55 f 

»> Seems to be a Type la membrane protein 

The cytoplasmic tail is from 1219 to 1255 (37 Residues) 

Rule : vesiculcur pathway 
Rule: vesicular pathway 
Rule: vesicular pathway 
(14) or uncleavable? 

Gavel: Examining the boundary of mitochondrial targeting seq. 
motif at: 14 

Uncleavable? Ipos set to: 24- 
Discrimination of mitochondrial teurget seq. : 
positive (2.18) 

Rule: vesicular pathway 
Rule: vesiculeir pathway 
Rule: vesicular pathway 

*** Reasoning Step: 2 

KDEL Coxint : 0 

C!hec]cing apolar signal for intramitochondrial sorting 
((3avel position 24) from: 1 to: 10 Score: 8.0 

SKL motif (signal for peroxisomal protein) : 
pos: 964(1255), count: 1 SRL 
SKL score (peroxisome) : 0.1 

Amino Acid Composition Tendency for Peroxisome: 1.37 

AAC not from the N-term. , score modified 
Peroxisomal proteins? Status: notclr 
AAC score (peroxisome): 0.079 
Amino Acid Coniposition tendency for lysosomal proteins 
score: 0.39 Status: notclr 

GY motif in the tail of typela? (lysosomal) 

Checking the amount of Basic Residues (nucleus) 

Clhecking the 4 residue pattern for Nuclear Targeting 
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Checking the 7 residue pattern for Nuclear Teirgeting 
Checking the Robbins & Dingwall consensus (nucleus) 

Checking the RNA binding motif (nucleus or cytoplasm) 

Nuclectr Signal Status: negative ( 0.00) 

5 Type la is favored for plasma memb. proteins 
Checking the NPXY motif.. 

Checking the YXRF motif. . 

Checking N-myristoylation. . 

10 Final Results 

plasma mambrane — - Certainty^ 0.460 (Affizmative) < suco 

microbody (perosisome) Certainty^ 0.171 (Affizmative) < suco 

endoplasmic reticulum (membrane) Certainty= 0.100 (Affizmative) < suco 

endoplasmic reticulum (lumen) — — Certaintys 0 . 100 (Affizmative) < suco 

15 

SEQ ID NO: 6042 appears to have a N-terminus signaling region, followed by a surface 
exposed region, followed by a transmembrane region followed by a C-terminus cytoplasmic 
domain region. Accordingly, the invention includes an inmumogenic, surface exposed fragment 
of SEQ ID NO: 6042. Preferably, said fragment comprises an amino acid sequrace which does 
20 not include the last 50 amin o acids of the C-terminus of SEQ ID NO: 6042. Preferably, said 

fragment comprises an amin o acid sequence which does not include the last 70 amihb acids of 
the C-terminus of SEQ ID NO: 6042. Preferably, said fragment does not include a transdomain ■ 
region of SEQ ID NO: 6042. Preferably, said fragment does not include a C-terminus 
cytoplasmic domain of SEQ ID NO: 6042. Preferably, said fragment does not include a N- 
25 terminus signal sequence. Preferably, said fragment does not include amino acids 1-10 of the N- 
terminus of SEQ ID NO: 6042. Preferably, said fragment does not include amin o acids 1-14 of 
the N-terminus of SEQ ID NO: 6042. Two oligopeptide fragments of SEQ ID NO: 6042 that are 
able to elicit anti-spike antibodies are SEQ ID NOS: 7398 & 7399, as described (with additional 
C-terminus cysteines) by Xiao et ed. (2003) Biochem Biophys Res Comm 312:1159-1164. 

30 C-terminal truncations of spike protein, with removal of part of the cytoplasmic region, or 
removal upto and including the transmembrane region, are described by Yang et oL (2004) 

Nature 428:561-564. 

A variant of SEQ ID NO: 6042 that is included within the invention is SEQ ID NO: 9962. 
Compared to SEQ ID NO: 6042, this sequence has Ser at residue 581 instead of Ala, and has Phe 
35 at residue 1152 instead of Leu. 

The spike protein of coronaviiuses may be cleaved into two separate chains into SI and S2. 
The chains may remain associated together to form a dimer or a trimer. Accordingly, the 
invention includes a polypeptide comprising SEQ ID NO: 6042 wherein said polypeptide has 
been cleaved into SI and S2 domains. Hie invention fiuth^ includes a polypeptide comprising 
40 SEQ ID NO: 6042 wherein amino acids 1-10, preferably amino acids 1-14 of tiie N-terminus are 
removed and further wherein SEQ ID NO: 6042 is cleaved into SI and S2 domains. Preferably 
the polypeptide is in the form of a trimer. 
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The spike protein appears to form an alpha-helical structure in the transmembrane region- 
of the protein, preferably in tfie S2 domain. This alpha-helical structure is thought to associate 
with at least two additional spike proteins to form a trimer. Ifelical or coiled regions of the spike 
protein are identified below. Predicted coiled-coils of SEQ ID NO: 6042 (spike protein) are at 
5 amino acids 900-1005 and 1 151-1 185 (see Rgure 12). 

Accordingly, the invention comprises a polypeptide sequence comprising a fragment of 
SEQ ID NO: 6042 wherein said fragment includes a coiled region of SEQ ID NO: 6042. Said 
fragment preferably includes the amin o acid sequences selected from the group consisting of 
amino acid positions 900 to 1005 and amin o acid positions 1151 to 1185 of SEQ ID NO: 6042. 

10 The invention comprises a polypeptide sequence comprising a fragment of SEQ ID NO; 6042, 
wherein said fragment does not include a coiled region of SEQ ID NO: 6042. Said fragment 
preferably includes the amin o acid sequences selected from the group consisting of amin o acid 
positions 900 to 1005 and amin o acid positions 1151 and 1185 of SEQ ID NO: 6042. 

The spike protein is believed to play an integral role in fusion and infection of 
15 Coronavimses with mammalian host cells. Analysis of coronavims spike proteins as well as 
simil ar surface proteins in other viruses has identified at least two stractural motifs, typicaUy 
located within the S2 domain, associate with this fusion event: heptad repeats (HR) and 
membrane fusion peptides. 

At least two 4,3 hydrophobic heptad repeat (HR) domains are typically found in the 
20 ectodomain of the S2 domain of Coronavimses. One heptad repeat region (HRl) is typically 

located adjacent to a fusion peptide while a second heptad region (HR2) is typically located near 
the C-terminus of the S2 domain, close to the transmembrane anchor. Heptad repeats are 
characteristic of coiled-coil structures and the heptad repeats found in viral surface proteins (such 
as coronavims spike protein) are thought to form bundled helix stmctures which are involved in 
25 viral entry. See Bosch et aL,J. Virology (2003) 77:8801-881 1 (Rgure IB of this reference 
iUustrates an alignment of the HRl and HR2 regions of five coronavimses along with SARS, 
annotated “HCov-SARS”). 

Heptad repeats generally contain a repeating stmcture of seven amino acids, designated a- 
b-c-d-e-f-g, where hydrophobic sidechains of residues a and d typically form an apolar stripe, 

30 and electrostatic interactions are found in residues e and g. Position a is most frequently Leu, Be 
or Ala and position d is usuaUy Leu or Ala. Residues e and g are often Glu or Gin, with Arg and 
Lys also prominent at position g. Charged residues are c ommo n to positions b, c and/ as these 
residues may be in contact with solvent Exceptions to these general parameters are known. For 
instance Pro residues ate sometimes found within the heptad. 

35 The HRl and HR2 sequences of an MHV strain have been postulated to assemble into a 

thmmostable, oligomeric, alphahelical rold-like complex, with the HRl and HR2 helices 
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oriented in an antiparallel nianne r. Id. In this same study, HR2 was asserted to be a strong 
inhitibor of both virus entry into the cell and cell-cell fusion. 

HRl and HR2 sequences have been identified in the SARS virus genome. ITie SARS virus 
HRl region comprises approximately amin o acids 879 to 1005 of SEQ ID NO: 6042 or 
S fragments thereof cq>able of forming at least one alpha-helical turn. Preferably, said fiagments 

comprise at least 7 {e.g., at least 14, 21, 28, 35, 42, 49 or 56) amin o acid residues. SEQ ID NO: 
7192, includes amin o acids 879 to 1005 of SEQ ID NO: 6042. 

A preferred fragment of HRl comprises amino acid residues 879 to 980 of SEQ ID NO: 
6042. This preferred fragment is SEQ ID NO: 7193. 

10 AnothCT preferred fragment of HRl comprises amino acid residues 901 to 1005 of SEQ 

ID NO: 6042. This preferred fragment is SEQ ID NO: 7194. 

The SARS virus HR2 region comprises approximately amin o acids 1144 to 1201 of SEQ 
ID NO: 6042, or fragments thereof capable of forming at least one alpha-helical turn. Preferably, 
said fiagments comprise at least 7 (e.g., at least 14, 21, 28, 35, 42, 49 or 56) amino acid residues. 
15 SEQ ID NO: 7195 includes amino adds 1144 to 1201. A preferred fragment of HR2 comprises 
amin o acids 1144 to 1195 of SEQ ID NO: 6042. This preferred fragment is SEQ ID NO: 7196. 

Membrane Fusion peptides sequences within the spike protein are also believed to 
participate in fusion (and infection) of the virus with a host ceU. Fusion peptides generally 
comprise about 16 to 26 amino acid residues which are conserved within viral families. These 
20 Membrane Fusion pq)tides are relatively hydrophobic and generally show an asymmetric 

distribution of hydrophobitiy when modeled into an alpha helix. They are also generally rich in 
alanine and glycine. 

At least three hydrophobic Membrane Fusion peptide regions have been identified within 
coronaiviruses (PEPl, PEP2, and PEP3). See, Luo et aL, “Roles in CeU-CeU Fusion of Two 
25 Conserved Hydrophobic Regions in the Murine Coronavirus Spike Protein”, Virology (1998) 

244:483-494. Rgure 1 of this paper shows an alignment of Membrane Fusion peptide sequences 
of Mouse Hepatitis Viris, Bovine Corona Virus, Feline Infectious Peritonitis Virus, 
Transmissible Gastroenteritis Virus and Infectious Bronchitis Virus. See also, Bosch et al., “The 
Coronavirus Spike Protein is a Qass I Virus Fusion Protein: Stractural and Functional 
30 Characterization of the Fusion Core Complex” Journal of Virology (2003) 77(16):8801-8811. 

PEPl (SEQ ID NO: 7197), PEP2 (SEQ ID NO: 7198) and PEP3 (SEQ ID NO: 7199) 
sequences within the SARS spike protein have been identified. 

The coronavirus spike proteins (and other similar surface viral proteins) are thought to 
undergo a conformational change upon receptor binding to the tai^get ceU membrane. One or 
35 niore of the hydrophobic Membrane Fusion peptides are thougfit to become exposed and inserted 

into the target membrane as a result of this conformational change. The firee energy released 
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upon subsequent refolding of the spike protein to its most stable conformation is believed to play 
a role in the m^er of the viral and cellular membranes. 

One or more SARS HR sequences, preferably HR2, or a fragm^t thereof may be used to 
inhibit viral entry and membrane fusion with a target mammalian host cell. The invention 
5 provides a method of inhibiting viral infection comprising administering a composition 
comprising one or more SARS HR polypeptides or a fragment tfiereof. Preferably, the 
composition comprises a SARS HR2 sequence. 

In anoth^ embodiment, the invention includes a composition comprising a SARS HRl 
sequence, or a fragment thereof and a SARS HR2 sequence, or a fra gme nt thereof. Hie HRl and 
10 HR2 sequences may optionally be associated together in an oligomer. The composition may 
comprise the intermediate domain sequence between the HRl and HR2 domains. The use of 
such an intermediate sequence may facilitate oligomeru^on or other structural interaction 
between the HR regions. 

HR sequences for use in the invention may be produced recombinandy by methods known 
15 . in the art The SARS HR sequences may be modified to facilitate bacterial expression. In 

particular, the HR sequences may be modified to facilitate transport of the recombinant protein 
to the surface of the bacterial host cell. For example, leader sequences to a bacterial membrane 
prot^ may be added to the N terminus of the recombinant HR sequences. HR sequences for use 
in the invention may alternatively be produced by chemical synthesis by methods known in the 
20 art (see below). 

As discussed in more detail later in the specification. Applicants have identified structural 
similarities between the SARS spike protein and the surface protein of Neisseria meningitidis, 
NadA (and other similar bacterial adhesion proteins). Another means of facilitating bacterial 
expression of HR sequences includes the addition of the stalk and/or anchor sequences of a 
25 NadA-like protein to the C-terminus of the recombinant HR sequences. Recombinant sequences 
containing the bacterial anchor sequence may preferably be prepared in outer membrane vesicles 
(the preparation of which is discussed in more detail later in the application). Recombinant 
sequences missing the bacterial anchor sequences may be secreted and isolated from the 
supernatant 

30 Ihe invention includes a polypeptide sequence comprising a first sequence and a second 

sequence, wherein said first sequence comprises a leader sequence for a bacterial membrane 
protein and wherein said second sequence comprises a HR sequence of a coronavirus. 

Preferably, said first sequence comprises the leader sequence for a bacterial adhesin protein. 
More prefi^ably, said bacterial adhesion protmn is NadA. Preferably said second sequence 
35 comprises HRl, HR2 or both, hi one embodiment, die second sequence comprises HRl, HR2 
and the intermediate domain sequence present in the naturally occrding spike protein. For 
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example, the second sequence may comprise a fragment of a coronaviius spike protein 
comprising the amino acids starting with the N-terminus of the HRl region and ending with the 
C-tenninus of the HR2 region. 

The invention furth^ includes a polypeptide sequence comprising a first, second, third and 
5 fourth sequence, wherein the first sequence comprises a leader sequence for a bacterial 

membrane protein; wherein said second sequence comprises a BDR. sequence of a coronaviius; 
wherein said third sequence comprises a stalk domain of a bacterial adhesion protein; and 
wherein said fourth sequence comprises an anchor domain of a bacterial adhesion protein, fri 
one embodiment, the first sequence comprising the leader peptide ^uence is removed. In 
10 another embodiment, the third sequmice comprising the stalk domain is removed. In another 
embodiment, tire fourth sequence comprising the anchor domain is removed. 

The polypeptide sequences of the above described constructs may be linked together by 
means known in the art, including, for example, via glycine linkers. 

' Examples of constructs which niay be used, in such bacterial expression systems are showm 
15 in FIGURE 50. Polypeptide sequences of each of the constructs illustrated in FIGURE 50 are 
given as SEQ ID NOS: 7200 to 7206.- 

7200 LeaderNadA(l-29)-HRl (879-980)-6XgIy-HR2(1144-1195)-sialk+anchorNadA(88-405) 

7201 LeaderNadA(l-29)-HRl (879-980)-6Xgly-HR2(1144-1196)-stalkNadA(88-351) 

7202 Leader NadA (1-29)- HRl -HR2 (879-1 196) -stalk+anchorNadA (88-405) 

7203 Leader NadA (1-29) - HRl - HR2 (879-1 196)-stalk NadA (88-351) 

7204 HRl -HR2 (879-1 196)-stalk NadA (88-35 l)-6xhis 

7205 Leader NadA (1-29) - HRl - HR2 (879-1 196)-anclK)r NadA (351-405) 

7206 LeadCT NadA (1-29) - HRl - HR2 (879-1 196) 

Adimnistration of one of more of these Membrane Fusion sequences may also interfere 
with the ability of a coronaviius to fu^ to a host cell membrane. Accordingly, the invfention 
20 includes an isolated polypq^itide comprising an amino acid sequence selected from the group 
consisting of SEQ ID NO: 7197, SEQ ID NO: 7198and SEQ ID NO: 7199. The invention 
further includes an isolated polypeptide comprising an amino acid sequence having sequence 
homology to an amino acid sequence selected from the group-consisting of SEQ ID NO: 7197, 
SEQ ID NO: 7198and SEQ ID NO: 7199. 

J5 Two or more of these S ARS Membrane Fusion peptides can be combined together. The 

invention includes a composition comprising two SARS Membrane Fusion peptides wherein said 
peptides are selected from at least two of the amino adds selected from the group consisting of 
SEQ ID NO: 7197, SEQ ID NO: 7198and SEQ ID NO: 7199, or a sequence having sequence 
identity thraeto. 

to Two or more of die SARS Membrane Fusion peptides may be linked together. 

Accordingly, the invention includes a polypeptide comprising a first amino add sequence and a 
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second amino add sequence, wherein said first and second amino add sequences are selected 
from the group consisting of SEQ K) NO: 7197, SEQ ID NO: 7198and SEQ ID NO; 7199, or a 
sequence having sequence identity thereto. Preforably, said first amino acid sequence and said 
second amino add sequence are different SAKS Membrane Fusion peptides, Le., they are not the 
5 same. 

Ihe invention dso includes a method of treating or preventing SARS virus infection 
comprising administering one or more of the SARS Membrane Fusion peptide compositions 
described above. 

As discussed above, the spike protein is capable of forming tiimers. The invention further 
10 includes a poly^ptide comprising SEQ ID NO: 6042 in bimeric form. The invention includes a 
composition comprising at least polypeptides wherein each polypeptide comprises at least the 
alpha-helical coiled region of a SARS virus spike protein. Preferably, the spike protein 
comprises SEQ ID NO: 6042 or a fragment thereof. 

The invention further includes a composition com prising a SARS virus spike protein or a ’ 
15 fragment thereof wherein said protein is assodated with a transmembrane and wherein said 

fragment comprises the alpha-helical region of the SARS virus spike proteiiL Preferably, the 
composition comprises at least three SARS virus spike proteins or a fra gm ent thereof, wherein 
the fragment comprises the alpha-heUcal region of the SARS virus spike protein. 

Ihe invention further includes an antibody which spedfically binds to a trimaic form of 
20 SARS virus spike proteins. Preferably, the spike protein comprises SEQ ID NO: 6042 or a 

fragment thereof. The invention includes an antibody which spedficaUy binds to a tiimeric form - 
of SARS virus spike proteins wherein said proteins are associated with a transmembrane. 

The invention further includes an antibody which specifically binds to a monomeric form 
of SARS virus spike protein or a fragment thereof. Preferably, the antibody specifically binds to 
25 a monomeric form of SEQ ID NO: 6042 or a fragment thereof. 

The invention further includes a small molecule which interferes with or dismpts the 
coiling of a SARS viral spike protein trimer. 

The invention further includes an attenuated SARS virus for use as a vaccine wherein said 
attenuated virus contains a polynucleotide insertion, deletion or substitution which does not 
JO disrupt the trimeric conformation of the SARS virus spike protein. The invention further 

includes an attenuated SARS virus for use as a vaccine wherein said attenuated virus contains a 
polynucleotide insertion, deletion or substitution which does not disrupt the alpha-helical 
formation of the SARS virus spike protein. 

The spike protein may be recomibinantly produced. Li one embodim^t, the spike protein 
J5 is expressed in virus like particles so that the protein is attached to a cell membrane. Such 

attachment may facilitate presentation of immunogenic ^itopes of the spike protein. Preferably, 
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the alpha-helical portion of the spike protein is associated with the cell membrane. Preferably, 
the spike proteins form a trimer within the transmembrane region of attachme nt 
Predicted N-gJycosylation sites of SEQ ID NO: 6042 are identified below: 

Position 

5 

29 NYTQ 
65 NVTG 
109 NKSQ 
119 NSTN 
10 158 NCTF 

227 NITN 
269 NGTI 
318 NITN 
330 NATK 
15 357 NSTF 

589 NASS 
602 NCTD 
699 NFSI 
783 NFSQ 
20 1080 NGTS 

1116 NNTV 
1176 NESL 

Accordingly, the invention includes a polypeptide comprising a fragment of SEQ ID NO: 
25 6042 wherein said ftagment comprises one or more of the glycosylation sites identified above 

(SEQ ID NOS: 7207-7223). The invention further includes a polynucleotide encoding one or 
more of the fragments identified above. This glycosylation site can be covalently attached to a 
saccharide. Accordingly, the invention includes a polypeptide comprising a fragment of SEQ ID 
NO: 6042 wherein said fragment comprises one or more of the glycosylation sites identified 
10 above and wherein said polypeptide is glycosylated at one or more of the sites identified above. 
Predicted O-glycosylation sites are identified below: 



Residue No. Potential Threshold Assignment 



Thr 


698 


0.8922 


0.7696 


T 


Thr 


706 


0.9598 


0.7870 


T 


Thr 


922 


0.9141 


0.7338 


T 


Ser 


36 


0.8906 


0.7264 


S 


Ser 


703 


0.8412 


0.7676 


S 



The invention inclu^ a polypeptide comprising a fragment of SEQ ID NO: 6042 wherein 
0 said fragment comprises one or more of the O-glycosylation sites identified above. The 
invention further includes a polynucleotide encoding one or more of the fragments identified 
above. The invention further includes a polypeptide comprising a fragment of SEQ ID NO: 

6042 wherein said fragment courses one or more of the O-glycosylation sites identified above 
and further wh^in the polypeptide is covalently bonded to a saccharide at one or more of the 
5 included glycosylation sites. 

Hie invention further includes a polypeptide comprising a fragment of SEQ ID NO: 6042 
wherein said fragmrait comprises one or more of the N-glycosylation sites identified above and 











Potential 


Jury 


NGlyc 












agreement 


resiilt 


SEQ 


ID 


NO: 


7207 


0.7751 


(9/9) 


+++ 


SEQ 


ID 


NO: 


7208 


0.8090 


(9/9) 


+++ 


SEQ 


ID 


NO: 


7209 


0.6081 


(7/9) 


+ 


SEQ 


ID 


NO: 


7210 


0.7039 


(9/9) 


++ 


SEQ 


ID 


NO: 


7211 


0.5808 


(7/9) 


+ 


SEQ 


ID NO: 


7212 


0.7518 


(9/9) 


+++ 


SEQ 


ID 


NO: 


7213 


0.6910 


(9/9) 




SEQ 


ID 


NO: 


7214 


0.6414 


(9/9) 


++ 


SEQ 


ID 


NO: 


7215 


0.6063 


(8/9) 


+ 


SEQ 


ID 


NO: 


7216 


0.5746 


(8/9) 


+ 


SEQ 


ID 


NO: 


7217 


0.5778 


(6/9) 


+ 


SEQ 


ID 


NO: 


7218 


0.6882 


(9/9) 


+-I- 


SEQ 


ID 


NO: 


7219 


0.5357 


(7/9) 


+ 


SEQ 


ID 


NO: 


7220 


0.6348 


(9/9) 


++ 


SEQ 


ID 


NO: 


7221 


0.5806 


(7/9) 


+ 


SEQ 


ID 


NO: 


7222 


0.5106 


(5/9) 


+ 


SEQ 


ID 


NO:. 


7223 


0.6796 


(9/9) 


++ 
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furtiier whCTein said fiagment compnses one or more of the 0-glycosylation sites identified 
above. 

Ihe invention includes a polypeptide conopiising a fiagment of SEQ ID NO: 6042 wherein 
said fiagment does not include one or more of the glycosylation sites identified above. Tlie 
5 invention also includes a polynucleotide encoding such a polypeptide. 

Predicted phosphorylation sites of SEQ ID NO: 6042 are Ser-346, Tyr-195, and Tyr-723. 
Accordingly, the invention comprises a polypeptide comprising a fiagment of SEQ ID NO: 6042 
wherein said fiagment comprises at least ten amin o acid residues and wherein said fiagment 
comprises one or more of the amino acids selected fiom the group consisting of Ser-346, Tyr- 
10 195, and Tyr-723. In one embodiment, one or more of the amino acids selected from the group 

consisting of Ser-346, Tyr-195, and Tyr-723 arc phosphorylated. 

Expression and functional characterization of the spike glycoprotein has been described by 
Xiao et al. (2003) Biochem Biophys Res Comm 312:1159-1164. 

T-epitopes for SEQ ID NO: 6042 are identified in Table 16. The invention includes a 
15 polypeptide for use as an anti^n, wherein the polypeptide comprises: (a) an amino add 

sequence selected fioiri the group consisting of the T-epitope sequences identified as SEQ ID 
NOS: 8041-8280; (b) an amino acid sequence having sequence identity to an amino add 
sequence of (a). The invention further comprising a polynucleotide sequence encoding' the 
polypeptides of (a) or (b). The invention furth»- comprising a method of expression or delivery 
20 of such polynucleotides throu^ viral vectors and/or viral particles. The invention further 

comprises a polypeptide comprising two or more of the T-epitope sequences identified in SEQ 
ID NOS: 8041-8280, or a polynucleotide encoding such a polypeptide. 

The use as an antigen is preferably a use: (1) as a T-cell antigen; (2) for generating a 
complex between a class I MHC protein {e.g. a class I HLA) and a fiagment of said antigen; (3) 
25 as an antigen for raising a ceU-mediated imm une response; and/or (4) as an anrigen for raising a 
CTL response. The use preferably protects or treats disease and/or infection caused by a SARS 
virus. The invention provides the use of a polypeptide in the manufacture of a medicament for 
immunising a mammal (typically a human) against SARS viral infection wherein the polypeptide 
is as defined above. 

30 The invention provides a method of raising an imm une response in a mammal (typically a 

human), comprising the step of administering to the mammal a polypeptide as defined above, 
wherein said immune response is a cell-mediated immu ne response and, preferably, a CTL 
response. The immune response is preferably protective or therapeutic. 

The invention includes a polypeptide comprising SEQ ID NO: 6040. The invention 
35 includes a polypeptide comprising an amin o acid sequence having sequence identity to SEQ ID 
NO: 6040. The invention includes a fiagment of a polypeptide comprising SEQ ID NO: 6040. 
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Hie invention includes a polynucleotide encoding SEQ ID NO: 6040 or a fragment thereof. The 
invention includes a diagnostic kit comprising a polypeptide comprising SEQ ID NO: 6040 or a 
fragment thereof. Hie invention includes a diagnostic kit comprising a polynucleotide sequence 
encoding SEQ ED NO: 6040 or a fragment thraeof. Tlie invention includes an antibody which 
5 recognizes a polypeptide comprising SEQ ID NO: 6040 or a fragment thereof. 

SEQ ID NO: 6040 demonstrates functional homology with a membrane protein of 
coronaviruses. Predicted transmembrane helices of SEQ ID NO: 6040 are identified below: 

Predicted Transneinbraiie Helices 

10 The sequence positions in brackets denominate the core region . 

Only scores above 500 are considered significant. 

Inside to outside helices : 3 found 

from to score center 

15 27 ( 30) 48 { 45) 1138 38 

137 ( 139) 153 ( 153) 486 146 

Outside to inside helices : 3 found 







from 






to 


score 


center 


20 


28 


(• 31) 


45 


( 


45) 


819 


38 




71 


( 73) 


90 


( 


90) 


210 


81 




136 


( 142) 


156 


( 


156) 


272 


149 



Hie amino acid region with the hipest predicted transmembrane helical region is from 
25 amino acid position 27 to 48 of SEQ ID NO: 6040. Such transmembrane regions are often 

difficult to express recombinantly. Accordingly, the invention includes a polypeptide 
comprising a fragment of SEQ ID NO: 6040 wherein said fragment does not include the amino 
acid sequence between positions 27 to 48. Hie invention includes a polypeptide comprising a 
fragment of SEQ ID NO: 6040 wherein said fragment does not include the amin o acid sequence 
30 between positions 28 to 45. The invention also includes a polynucleotide sequence encoding any 
of the above-identified polypqptides. 

SEQ ID NO; 6040 is predicted to be a hypothetical protein of the SARS virus. A 
prediction of the protein localization of SEQ ID NO: 6040 is set forth below. SEQ ID NO: 6040 
is predicted to be located in one of the following locations: mitochondrial matrix space, 

35 microbody (peroxisome), nucleus, and mitochondrial iimer membrane. SEQ ID NO: 6040 is 
predicted to be associated with an organelle inside an infected cell. 

Accordingly, SEQ ID NO: 6040 is a target for scre^ng of chemical inhibitors to the 
SARS virus. The invention includes a polypeptide comprises SEQ ID NO: 6040 or a fragment 
thereof. The invention includes a polynucleotide encoding the polypeptide sequence of SEQ ID 
40 NO: 6040 or a fragment thereof. The invention includes a method of screerung SEQ ID NO: 

6040 for an inhibitor. The invention includes the recombinant expression of SEQ ID NO: 6040 
in a host cell. The invention includes a small molecule which prevents the polypeptide of SEQ 
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ID NO: 6040 firom associating with an organelle inside of an infected cell. Hie invention 
includes a fiision protein wherein said fusion protein comprises SEQ ID NO: 6040. 

PSORT Prediction o£ Protein Localization Sites 

version 6 . 4 (WWW) 

5 SEQ ID NO: 6040 163 Residues 

Species classification: 4 

*** Reasoning Step: 1 

10 Preliminary Calculation of ALOM {threshold: 0.5) 

count : 0 

McG: Examining signal sequence (Mc(3eoch) 

Length of UR: 9 

Pealc Value of UR: 1.75 

15 Net Charge of CR: 1 

Discriminant Score: -2.56 

GvH: Examining signal sequence (von Heijne) 

Signal Score (-3.5): 1.94 
Possible cleavage site: 53 
20 »> Seems to have no N- terminal signal seq. 

Amino Acid Composition of Predicted Mature Form: 
calculated from 1 

ALOM new cnt: 0 ** thrshld changed to -2 
Cleavable signal was detected in ALOM?: OB 
25 ALOM: finding transmembrane regions (Klein et al.) 

count: 0 value: 1.32 threshold: -2.0 

PERIPHERAL Likelihood =1.32 
modified ALOM score: -1.16 

Gavel: Examining the boundary of mitochondrial targeting seq. 

30 motif at: 156 

HRSVTI 

Discrimination of mitochondrial target seq. : 
notclr ( 0.88) 

Rule: mitochondrial protein 
35 Rule: mitochondrial protein 
Rule: mitochondrial protein 
Rule: mitochondrial protein 

*** Reasoning Step: 2 

40 

KDEL Count : 0 

Checking apolar signal for intramitochondrial sorting 
(Gavel position 156) from: 27 to: 44 Score: 5.0 

Mitochondrial matrix? Score: 0.36 

45 SKL motif (signal for peroxisomal protein) : 
pos: 99(163), co\mt: 1 SKL 
SKL score (peroxisome): 0.3 

Amino Acid Composition Tendency for Peroxisome: -4.28 

Peroxisomal proteins? Status: notclr 
50 Amino Acid Composition tendency for lysosomal proteins 
score: 0.02 Status: notclr 

Modified score for lysosome: 0.152 
Checking the amount of Basic Residues (nucleus) 

Checking the 4 residue pattern for Nuclear Targeting 
55 Found: pos: 132 (5) KRKR 

(flecking the 7 residue pattern for Nuclear Targeting 
Checking the Robbins & Dingwall consensus (nucleus) 

Checking the RNA binding motif (nucleus or cytoplasm) 
nuc modified. Score: 0.60 

60 Nuclear Signal Statois: notclr ( 0.30) 
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Checking CaaX motif.. . 

Checking N-myristoylation. . 

Chec)cing CaaX motif. , 

5 Final Results 

mitochondrial matrix space Certainty^ 0.480 (Affixmative) < suco 

microbody (peroxisome) Certainty^ 0 .300(AffizmativB) < suco 

nucleus Certainty^ 0.300 (Affirmative) < suco 

mitoc h on d rial inner membrane Certainty^ 0.18 8 (Affirmative) < suco 

10 

Predicted N-glycosylation sites of SEQ ID NO: 6040 are identified below. 

Position Potential Jury NGlyc 

agreement result 

15 2 NKTG (SEQ ID NO: 7255) 0.7804 (9/9) +++ 

106 NLTL (SEQ ID NO: 7256) 0.6123 (7/9) + 

Accordingly, the invention comprises a fragment of SEQ ID NO: 6040 wherein said 
fragment is at least ten amino acids and wherein said fragment comprises one or more of the 
20 asparagines from the amino add positions of SEQ ID NO: 6040 selected fiom the group 

consisting of 2 and 106. Ibe invention includes a fragment of SEQ ED NO: 6040 wherein said 
fragment comprises one or more amino add sequences selected from the group consisting of 
SEQ ID NO: 7255 and SEQ ID NO: 7256. Preferably, the fragment comprises the aminn acid 
sequence NKTG (SEQ ID NO: 7255). 

25 The invention includes a polypeptide comprising a fragment of SEQ ID NO: 6040 wherein 

said fragment does not include one or more of the ^ycosylation sites identified above. The 
invention also includes a polynucleotide encoding such a polypeptide. 

T-epitopes for SEQ ID NO: 6040 are identified in Table 14. The invention includes a 
polypeptide for use as an antigen, wherein the polypeptide comprises: (a) an amino acid 
30 sequence selected from the group consisting of the T-epitope sequences identified in SEQ ID 
NOS: 7640-7800; (b) an amino acid sequence having sequence identity to an amino acid 
sequence of (a). The invention further comprising a polynucleotide sequence encoding the 
polypeptides of (a) or (b). The invention further comprising a method of expression or delivery 
of such polynucleotides through viral vectors and/or viral particles. The invention further 
35 comprises a polypeptide comprising two or more of the T-epitope sequences identified in SEQ 
ID NOS: 7640-7800, or a polynucleotide encoding such a polypeptide. 

The use as an antigen is preferably a use: (1) as a T-cell antigen; (2) for generating a 
complex between a class I MHC protein {e.g. a class I HLA) and a fragment of said antigen; (3) 
as an anti^n for raising a ceU-mediated immune response; and/or (4) as an antigen for raising a 
40 CTL response. The use preferably protects or treats disease and/or infection caused by a SAKS 

virus. 
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Hie invention provides the use of a polypeptide in the manufacture of a medicament for 
immonising a mammal (typically a human) against SARS viral infection wherein the polypeptide 
is as defined above. 

Hie invention provides a method of raising an immune response in a Tnammal (typically a 
5 human), comprising the step of administering to the mammal a polypeptide as defined above, 

wherein said immune response is a cell-mediated immune response and, preferably, a CTL 
response. Hie immu ne response is preferably protective or therapeutic. 

Hie invention includes a polypeptide comprising SEQ ID NO; 6041. SEQ ID NO; 6041 
demonstrates functional homology with a portion of an ORF lab polyprotein. Hie invention 
10 includes a polypeptide comprising an amin o acid sequence having sequence identity to SEQ ID 
NO; 6041. Hie invention includes a fragment of a polypeptide comprising SEQ ID NO; 6041. 
Hie invention includes a polynucleotide sequence encoding an amin o acid sequence having 
sequence identity to SEQ ID NO; 6041. Hie invention includes a polynucleotide encoding a 
fragment of a polypeptide comprising SEQ ID NO: 6041 . 

15 Hie invention includes a diagnostic kit comprising a polypeptide com prisin g SEQ ID NO: 

6041 or a fragment therof. Hie invention includes a diagnostic kit comprising a polynucleotide 
encoding SEQ ID NO; 6041 or a fragment thereof. Hie invention includes an immunogenic 
composition comprising a polypeptide comprising SEQ ID NO: 6041 or a fingment thereof. Hie 
invention includes an antibody which recognizes a polypeptide com prising SEQ ID NO: 6041 or 
20 a jfragment thereof. 

Hie polyproteins of coronaviruses are associated with enzymatic activity. Accordingly, 
SEQ ID NO: 6041 is a target for screening of chemical inhibitors to the SARS virus. Hie 
invention includes a polypeptide comprising SEQ ID NO: 6041 or a fragment thereof. Hie 
invention includes a polynucleotide encoding the polypeptide sequence of SEQ ID NO: 6041 or 
25 a fragment thereof. Hie invention includes a method of screening SEQ ID NO: 6041 for an 

inhibitor. Hie invention includes the recombinant expression of SEQ ID NO: 6041 in a host cell. . 
Hie invention includes a small molecule which prevents the polypeptide of SEQ ID NO: 6041 
from performing enzymative activity. Hie invention includes a fusion protein wherein said 
fusion protein comprises SEQ ID NO: 6041. 

30 Predicted transmembrane or hydrophobic regions of SEQ ID NO: 6041 are identified 

below. Although the polyprotein of coronaviruses is proteolytically cleaved into numerous 
smaller proteins, hydrophobic domains in the polyprotein are known to mediate the membrane 
association of the replication complex and to be able to dramaticaUy alter the architecture of host 
cell membranes. Accordingly, the hydrophobic domains of the polyprotein are tarots for 
35 genetic mutation to develop attenuated SARS virus vaccines. Hie hydrophobic domains are also 
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10 



15 



20 



25 



30 



35 



40 



45 



targets for small molecule inhibitors of the SARS virus. Hie hydrophobic do mains may also be 
used to genra:ate antibodies specific to those regions to treat or prevent SARS virus infection. 

Possible transmembrane helices o£ SEQ ID HO: 6041 

The sequence positions in brackets denominate the core region. 

Only scores above 500 are considered significcuit. 



Inside to outside helices 



18 found 





from 


to 


score 


center 


234 


{ 234) 254 


( 250) 


1046 


241 


256 


{ 256) 272 


( 270) 


252 


263 


319 


( 319) 334 


( 334) 


227 


327 


503 


( 505) 522 


( 519) 


405 


512 


613 


( 615) 633 


( 629) 


619 


622 


677 


( 679) 703 


( 696) 


467 


689 


849 


( 851) 869 


( 865) 


229 


858 


1080 


(1080)1097 


(1094) 


306 


1087 


1147 


(1149)1163 


(1163) 


354 


1156 


1557 


(1557)1581 


(1577) 


817 


1567 


1954 


(1954)1971 


(1971) 


832 


1964 


2369 


(2372)2395 


(2387) 


300 


2379 


2513 


(2513)2532 


(2529) 


690 


2522 


Outside to inside l^elices : 14 found 




from 


to 


score 


center 


239 


( 239) 254 


( 254) 


924 


247 


239 


( 248)‘ 272 


( 263) 


468 


256 


311 


( 314) 334 


( 328) 


267 


321 


499 


( 503) 522 


( 519) 


485 


512 


617 


( 617) 634 


( 631) 


425 


624 


849 


( 853) 872 


( 872) 


572 


864 


1147 


(1147)1162 


(1162) 


765 


1155 


1564 


(1564)1581 


(1579) 


883 


1572 


1951 


(1951)1968 


(1966) 


657 


1958 


2513 


(2522)2539 


(2537) 


711 


2529 


Accordingly, the invention includes a poly] 



6041, wh^in said fingment comprises an amino add sequence including one or more of the 
hydrophobic transmembrane sequences identified above. The invention includes a polypeptide 
comprising a fragment of SEQ ID NO: 6041 wherein said fragment comprises one or more of the 
following polypeptide sequences of SEQ ID NO: 6041: 234-254, 613-633, 1557-1581, 1954- 
1971, 2513-2532, 239-254, 1564-1581, 1951-1968, 2513-2539. Preferably, the fiagment 
comprises one or more of the following polypeptide sequences of SEQ ID NO: 6041: 234-254 
and 239-254. The invention also includes polynucleotides encoding each of the polypeptide 
fragments id»itified above. 

The invention includes an attenuated SARS virus wherein said attenuated SARS virus 
contains an addition, deletion or substitution in the polynucleotides encoding for one of the 
hydrophobic domains identified above. Hie invention also includes a method for creating an 
attenuated SARS virus comprising mutating a SARS virus by adding, deleting or substituting the 
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vital genome of the SARS virus to alter the coding of one or more of the hydrophobic domains 
of SEQ BD NO; 6041 identified above. 

The invention includes an antibody which specifically identifies one or more of the 
hydrophobic regions of SEQ ID NO: 6041 identified above. 'Die invention includes a small 
5 molecule which binds to, interferes with the hydrophobidty of or otherwise disrapts one or mote 

of the hydrophobic regions of SEQ ID NO: 6041 identified above. 

Predicted N-glycosylation sites of SEQ ID NO: 6041 are identified below: 

Position Potential Jury NGlyc 

agreement result 



571 


NLSH 


(SEQ 


ID 


NO: 


7257) 


0.6598 


(8/9) 


+ 


835 


NTSR 


(SEQ 


ID 


NO: 


7258) 


0.5762 


(7/9) 


+ 


958 


NVTD' 


(SEQ 


ID 


NO: 


7259) 


0.7494 


(9/9) 


+ + 


1113 


NISD 


(SEQ 


ID 


NO: 


.7260) 


0,7259 


(8/9) 


+ 


1205 


NSTL 


(SEQ 


ID 


NO: 


7261) 


0.6296 


(9/9) 


+ + 


1460 


NVTG 


(SEQ 


ID 


NO: 


7262) 


0.6844 


(9/9) 


++ 


1685 


NHSV 


(SEQ 


ID 


.NO: 


7263) 


0.5181 


(5/9) 


+ ■ 


2029 


NKTT 


(SEQ 


ID 


NO: 


7264) 


0.5423 


(5/9) 


+ 



Accordingly, the invention comprises a polypeptide comprising a fragment of the amino 
20 acid sequence of SEQ ID NO: 604i; wherein said fragment comprises one or more of the N- 
glycosylation sites identified above. The invention comprises a polypeptide comprising a 
fragment of SEQ ID NO: 6041 wherein said fragment comprises one or more of sequences SEQ 
ID NOS: 7257-7264. Preferably, the fragment comprises one or more of the sequences SEQ ID ■ 
NOS: 7257, 7259, 7260, 7261 and 7262. The invention further comprises a polynucleotide 
25 encoding one or more of the polypeptides identified above. 

The invention includes a polypq>tide comprising a fragment of SEQ ID NO: 6041 wherein 
said fragment does not include one or more of the glycosylation sites identified above. The 
invention also includes a polynucleotide encoding such a polypeptide. 

T-epitopes for SEQ ID NO: 6041 are identified in Table 15. Tbe invention includes a 
30 polypeptide for use as an antigen, wherein the polypeptide comprises; (a) an amin o acid 

sequence selected from the group consisting of the T-epitope sequences identified in SEQ ID 
NOS: 7801-8040; (b) an ami no add sequence having sequence identity to an amin o add 
sequence of (a). The invention further comprising a polynucleotide sequence encoding the 
polypeptides of (a) or (b). Die invmition further comprising a method of expression or delivery 
35 of such polynucleotides throug}i viral vectors and/or viral particles. The invention further 

comprises a polypeptide comprising two or more of the T-epitope sequences identified in SEQ 
ID NOS: 7801-8040, or a polynucleotide encoding such a polypeptide. 

The use as an antigen is preferably a use: (1) as a T-cell antigen; (2) for generating a 
complex between a class I MHC protein (e.g. a class I HLA) and a fragment of said antigen; (3) 
40 as an antigen for raising a cell-mediated immune response; and/or (4) as an antigen for raising a 
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CTL response. The use preferably protects or treats disease and/or infection caused by a SARS 
virus. 



The invention provides the use of a polypqptide in the manufacture of a medicament for 
imnninising a mammal (typically a human) against SARS viral infection whraein the polypeptide 
S is as defined above. 

Hie invention provides a method of raising an immune response in a m ammal (typically a 

human), comprising the step of administering to the manDonal a polypeptide as defined above, 

wherein said immu ne response is a cell-mediated immune response and, preferably, a CTL 

response. The immun e response is preferably protective or therapeutic. 

10 The invention includes a polypeptide sequence SEQ ID NO: 6043 or a fragment thereof. 

The invention includes a polypeptide comprising an amino acid sequence having sequence 

identity to SEQ ID NO: 6043. The invention includes a polynucleotide sequence encoding the 

amino acid sequence of SEQ ID NO: 6043 or a firagment thereof. 

Predicted transmembrane regions of SEQ ID NO: 6043 are set forth below. 

15 Inside to outside helices : 4 found 

from to score center 

41 ( 41) 56 ( 56) 1789 49 

76 ( 79) 99 ( 99) 2142 89 

105 { 105) 125 { 125) 1250 115 

20 

Outside to inside helices : • 3 found 

from to score center 

41 ( 41) 59 ( 56) 2053 49 

76 ( 82) 98 ( 96) 1580 89 

25 103 ( 105) 125 { 123) 1257 115 

The amin o ^lcid region with the highest predicted transmembrane helical region is fix>m 
amino acid position 76 to 99 of SEQ ID NO: 6043. Such transmembrane regions are often 
difficult to express recombinantly. Accordingly, the invention includes a polypeptide 
30 comprising a fragment of SEQ ID NO: 6043 wherein said fragment does not include one or more 
of the hydrophobic amino acid sequences identified above. Preferably, the fragment does not 
include the ami no acids between positions 27 to 48. The invention also includes a 
polynucleotide sequence encoding any of the above-identified polypeptides. 

SEQ ID NO: 6043 is predicted to be a hypothetical protein of the SARS virus. A 
35 prediction of the protein localization of SEQ ID NO: 6043 is set forth below. SEQ ID NO: 6043 
is predicted to be located in one of the following locations: mitochondrial inner membrane, 
plasma membrane, Golgi body, and mitochondrial intermembrane space. SEQ ID NO: 6043 
may be associated with an organelle inside an infected cell. 

Accordingly, SEQ ID NO: 6043 is a target for screening of chemical inhibitors to the 
40 SARS virus. The invention includes a polypeptide comprises SEQ ID NO: 6043 or a fi-agment 
thereof. The invention includes a polynucleotide encoding the polypeptide sequence of SEQ ID 
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NO: 6043 or a fragment thereof. Tlie invention includes a method of screening SEQ ID NO: 
6043 for an inhibitor. Die invention includes the recombinant expression of SEQ ID NO: 6043 
in a host cell. Die invention includes a small molecule which prevents the polypeptide of SEQ 
ID NO: 6043 from associating with an organelle inside of an infected cell. Die invention 
5 includes a fusion protein wherein said fusion protein corc^rises SEQ ID NO: 6043. 

PSORT Prediction of Protein Loccilization Sites for SEQ ID NO: 6043 

version 6. 4 (WWW) 

Species classification: 4 
10 *** Reasoning Step: 1 



Preliminary Calculation of ALOM (threshold: 0.5) 
count ; 3 

Position of the most N- terminal TMS: 40 at i=2 
15 MTOP: membrane topology (Hartmann et al.) 

X (middle) : 47 Charge diffirence(C-N) : 3.5 

McG: Examining signal sequence (McGeoch) 

Length of UR: 12 

Peak Value of UR: 1.41 

20 Net Charge of CR: 0 

Discriminant Score -4.67 
GvH: Examining signal sequence (von Heijne) 

Signal Score (-3.5): 3.44 
Possible cleavage site: 15 
25 »> Seems to have no N-terminal signal seq. 

Amino Acid Composition of Predicted Mature Form: 
calculated from 1 

ALOM new cnt: 2 ** thrshld changed to -2 
Cleavable signal was detected in ALOM?: OB 
30 ALOM: finding transmembrane regions (Klein et al.) 

count: 2 value: -6.90 threshold: -2.0 

INTEGRAL Likelihood = -6.90 Transmembrane 83 - 99 ( 78 - 101) 

INTEGRAL Likelihood = -5.04 Transmembrane 40 - 56 ( 37 - 60) 

PERIPHERAL Likelihood = -0.32 ^ 

35 modified ALOM score: 1.48 

»> Likely a Type Illb membrane protein (Nexo Ccyt) 

Gavel: Examining the boundary of mitochondrial targeting seq. 
motif at: 128 
MRCWLC 

40 Discrimination of mitochondrial target seq. : 

notcir ( 0.76) 

Rule: mitochondrial protein 
Rule: mitochondrial protein 
Rule: mitochondrial protein 
45 Rule: mitochondrial protein 

*** Reasoning Step: 2 

Type Ilia or Illb is favored for ER memb. proteins 
50 KDEL Count : 0 

Checlcing apolar signal for intramitochondrial sorting 
(Gavel position 128) from: 39 to: 56 Score: 11.5 
»> Seems to have an intramitochondrial signal 
Mitochondrial inner membrane? Score: 0.59 

55 Mitochondrial intermemb. space? Score: 0.22 

SKL motif (signal for peroxisomal protein) : 
pos: 92(274), count: 1 SHL 
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SKL score (peroxisome); 0.3 
Amino Acid Composition Tendency for Peroxisome: 4.78 

Peroxisomal proteins? Status: j>ositive 
Amino Acid Composition tendency for lysosomal proteins 
5 score: 1.16 Status: notclr 

Type III proteins may be localized at Golgi 
Checking the amount of Basic Residues (nucleus) 

Qiecking the 4 residue pattern for Nuclear Targeting 
Checking the 7 residue pattern for Nuclear Targeting 
10 Checking the Robbins & Dingwall consensus (nucleus) 

Checking the RNA binding i3X)tif (nucleus or cytoplasm) 

Nuclear Signal Status: negative ( 0.00) 

Check the Number of TMSs for typelll (plasma memb.) 

Checking N-myristoylation. . 

15 

Final Results 

mitochondrial inner meinbrane Certainty^ 0.664 (Affirmative) < suco 

plasma membrane Certainty^ 0.600 (Affirmative) < suco 

Golgi body Certainty® 0.400 (Affirmative) < suco 

20 mitochondrial intermembrane space Certainty® 0*362 (Affirmative) < suco 



Predicted N- and O- glycosylation sites of SEQ ID NO: 6043 are identified below. 

Position Potential Jury NGlyc 

agreement result 

25 227 NATF (SEQ ID NO: 7265) 0.6328 (7/9) + 

Residue No. Potential Threshold Assignment 



Thr 


28 


0.9095 


0.6280 


T 


Thr 


32 


0.8740 


0.6595 


T 


Thr 


34 


0.9058 


0.6655 


T 


Thr 


170 


0.6816 


0.6600 


T 


Thr 


267 


0.9240 


0.5779 


T 


Thr 


268 


0.7313 


0.5708 


T 


Thr 


269 


0.9859 


0.5583 


T 


Thr 


270 


0.8023 


0.5492 


T 


Ser 


27 


0.6930 


0.6091 


S 


Ser 


252 


0.6457 


0.5977 


S 



Accordingly, the invention comprises a polypeptide comprising a fragment of the amino 
40 acid sequence of SEQ ID NO: 6043, wherein said fragment comprises the N-glycosylation sites 
or 0-glycosylation sites identified above. The invention comprises a polypeptide comprising a 
fragment of SEQ ID NO: 6043 wherein said fragment comprises one or more of the N- 
glycosylation sites or O-glycosylation sites identified above. The invention further comprises a 
polynucleotide encoding one or more of the polypeptides identified above. 

45 The invention includes a polypeptide comprising a fiagment of SEQ ID NO: 6043 wh^ein 

said fragment does not include one or more of the glycosylation sites identified above. The 
invention also includes a polynucleotide encoding such a polypeptide. 

T-epitopes for SEQ ID NO: 6043 are identified in Table 17. The invention includes a 
polypeptide for use as an antigen, wherein the polypeptide comprises: (a) an amino acid 
50 sequence selected from the group consisting of the T-epitope sequences identified in SEQ ID 

NOS: 8281-8486; (b) an amino acid sequrace having sequence identity to an amino acid 
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sequCTce of (a). The invention further comprising a polynucleotide sequence encoding the 
polypeptides of (a) or (b). The invration further comprising a method of expression or delivery 
of such polynucleotides throu^ viral vectors and/or viral particles. The invraition further 
comprises a polypeptide comprising two or more of the T-epitope sequences identified in SEQ 
5 ID NOS; 8281-8486, or a polynucleotide encoding such a polypeptide. 

Hie use as an antigen is preferably a use: (1) as a T-cell anti^n; (2) for g enftr ating a 
complex between a class I MHC protein <e.g. a class I HLA) and a fragment of said antigen; (3) 
as an antigen for raising a cell-mediated immune response; and/or (4) as an antigen for raising a 
CTL response. The use preferably protects or treats disease and/or infection caused by a SARS 
10 virus. Hie invention provides the use of a polypeptide in the manufacture of a medicament for 
immunising a mammal (typically a human) against SARS viral infection wherein the polypeptide 
is as defined above. 

The invention provides a method of raising an imm une response in a mammal (typically a 
human), comprising the st^ of administoing to the mammal a polypeptide as defined above, 

15 wherein said immune response is a ceU-mediated immune response and, preferably, a CTL 
response. The immune response is preferably protective or therapeutic. 

The invention includes a polypeptide comprising SEQ ID NO: 6044. The invention 
includes a polypeptide comprising a fegment of SEQ ID NO: 6044 or a sequence having 
sequence idaitity to SEQ ID NO;206. The invention includes a polynucleotide encoding SEQ 
20 ID NO: 6044. 

SEQ ID NO: 6044 is identified as a hypothetical protein. Predicted hydrophobic or 
transmembrane regions of SEQ ID NO: 6044 are identified below: 

Inside to outside helices : 3 fouad 

from to score center 

25 1 ( 1) 17 ( 15) 891 8 

47 ( 47) 66 ( 63) 221 56 

Outside to inside helices : 4 found 

from to score center 

30 1 ( 4) 21 ( 19) 599 11 

Accordingly, the invention includes a polypeptide comprising a firagment of SEQ ID NO: 
6044 wherein said firagment does not include one or more of the hydrophobic amino acid 
sequences identified above. Preferably, the fragment does not include the amino adds between 
35 positions 1 to 19. The invention also includes a polynucleotide sequence encoding any of the 
above-identified polypeptides. 

SEQ ID NO: 6044 is predicted to be a hypothetical protein of SARS virus. A prediction of 
the protein localization of SEQ ID NO: 6044 is set forth below. SEQ ID NO: 6044 is predicted 
to be located in one of the following locations; nucleus, mitochondrial matrix, lysosome (lumen), 
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and microbody (peroxisome). SEQ ID NO: 6044 may be associated with an organelle inside an 
infected cell. 

Accordingly, SEQ ID NO: 6044 is a target for screening of chemical inhibitors to the 
SARS virus. Hie invention includes a polypeptide comprises SEQ ED NO: 6044 or a fragment 
5 thereof. Hie invention includes a polynucleotide encoding die polypeptide sequence of SEQ ED 
NO: 6044 or a fragment thereof. Hie inveition includes a method of screening SEQ ED NO: 
6044 for an inhibitor. Hie invention includes the recombinant expression of SEQ ED NO: 6044 
in a host ceU. Hie invention includes a small molecule which prevents the polypeptide of SEQ 
ED NO: 6044 from associating with an organelle inside of an infected cell. Hie invention 

10 includes a fusion protein wherein said fusion protein comprises SEQ ED NO: 6044. 

PSORV Prediction o£ Protein Localization Sites for SEQ ID NO: 6044 

version 6 . 4 (WWW) 

154 Residues 

Species classification: 4 

15 

*** Reasoning Step: 1 _ 

Preliminary Calculation of ALOM (threshold: 0.5) 
coimt : 0 

20 McG: Examining signal sequence (McGeoch) 

Length of UR: 7 

Peak Value of UR: 1.06 

Net Charge of CR: 1 . 

Discriminant Score: -7.97 

25 GvH: Examining signal sequence (von Heijne) 

Signal Score (-3.5): -3.28 
Possible cleavage site: 34 
»> Seems to have no N-terminal signal seq. 

Amino Acid Composition of Predicted Matxire Form: 

30 calculated from 1 

ALOM new cnt: 0 ** thrshld changed to -2 
Cleavable signal was detected in ALOM?: OB 
ALOM: finding transmenibrane regions (Klein et al.) 
count: 0 value: 1.43 threshold: -2.0 

35 PERIPHERAL Likelihood = 1.43 

modified ALOM score: -1.19 

Gavel: Examining the boundary of mitochondrial targeting seq. 
motif at: 151 
FRKKQV 

40 Discrimination of mitochondrial teirget seq.: 

notclr (-0.46) 

*** Reasoning Step: 2 

45 KDEL Count : 0 

Checking apolar signal for intramitochondrial sorting 
(Gavel position 151) from: 46 to: 50 Score: 5.0 

Mitochondrial matrix? Score: 0.36 

SKL motif (signal for peroxisomal protein) : 

50 pos: -1(154), count: 0 

Amino Acid Composition Tendency for Peroxisome: 0.61 

Peroxisomal proteins? Status: notclr 
AAC score (peroxisome) : 0.149 
Amino Acid Composition tendency for lysosomal proteins 
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score: 0.81 Status: notclr 

Modified score for lysosome: 0.231 
Checking the cimoiint of Basic Residues (nucleus) 

Checking the 4 residue pattern for Nuclear Targeting 
5 Found: pos: 134 (3) KHKK 

Checking the 7 residue pattern for Nuclear Targeting 
Checking the Robbins & Dingwall consensus (nucleus) 

Found: pos: 136 (3) KK VSTNLCTHSF RKKQV 

Final Robbins Score (nucleus) : 0.60 

10 Checking the RNA binding motif (nucleus or cytoplasm) 
nuc modified. Score: 0.90 

Nuclear Signal Status: positive ( 0.70) 

Chec)d.ng CaaX motif. . 

Checlcing N-myristoylation. . 

IS Checlcing CaaX motif. . 

Final Results 

nucleus --- Certainty= 0.880 ( Affirmative) < suco 

mitochondrial matrix space — - Certainty^ 0.360 (Affirmative) < suco 

20 lysosoone (lumen) Certainty^ 0.231(Affirmative) < suco 

microbo^F (i>eroxisoiiie) Certainty^ 0.149 (Affirmative) < suco 

One predicted O-glycosylation site of SEQ E) NO: 6044 is identified at residue 4: 

Residue No. Potential Threshold Assignment 
25 Thr 4 0.6839 0.6484 T 

Accordingly, the invention comprises a polypeptide comprising a fragment of the amino 
add sequence of SEQ ID NO: 6044, wherein said fragment comprises the O-gJycosylation site 
identified above. The invention further comprises a polynucleotide encoding one or more of the 
30 polypeptides identified above. 

The invention includes a polypeptide comprising a fingment of SEQ ID NO: 6044 wherein 
said fragment does not include one or more of the glycosylation sites identified above: The 
invention also includes a polynucleotide encoding such a polypeptide. 

T-epitopes for SEQ ID NO: 6044 are identified in T^le 18. ITie invention includes a 
35 polypeptide for use as an antigen, wherein the polypeptide comprises: (a) an amino acid 

sequence selected from the group consisting of the T-epitope sequences identified in SEQ ID 
NOS: 8487-8665; (b) an amino acid sequence having sequence identity to an amino add 
sequence of (a). The invention further comprising a polynucleotide sequence encoding the 
polypeptides of (a) or (b). The invention further comprising a method of expression or delivery 
40 of such polynucleotides throng viral vectors and/or viral particles. Hie invention further 

comprises a polypeptide comprising two or more of the T-epitope sequences identified in SEQ 
ID NOS: 8487-8665, or a polynucleotide encoding such a polypeptide. 

The use as an antigen is preferably a use: (1) as a T-cell antigen; (2) for generating a 
complex between a class I MHC protein {e.g. a class I HLA) and a fragment of said antigen; (3) 
45 as an antigen for raising a ceU-mediated immune response; and/or (4) as an antigen for raising a 
CTL response. The use prefmably protects or treats disease and/or infection caused by a SAKS 
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viius. The invention provides the use of a polypeptide in the manufacture of a medicament for 
immunising a mammal (typically a human) against SARS viral infection wherein the polypeptide 
is as defined above. 

The invention provides a method of raising an immu ne response in a m amm al (typically a 
5 human), comprising the step of administering to the mammal a polypeptide as defined above, 

wherein said immune response is a cell-mediated irmnune response and, preferably, a CTL 
response. The immune response is preferably protective or therapeutic. 

The invention includes a polypeptide sequence com prising SEQ ID NO: 6045. The 
invention includes a polypeptide sequence comprising an amino acid sequence having sequence 
10 identity to SEQ ID NO: 6045. The invention includes a polypeptide sequence comprising a 

fragment of SEQ ID NO: 6045. The invention includes a polynucleotide sequence encoding any 
of these polypeptides. 

SEQ ID NO: 6045 demonstrates functional homology with the envelope or small 
membrane protein of coronaviruses.- The invention includes a diagnostic kit comprising' a 
15 polypeptide comprising SEQ ID NO: 6045 or a fragment thereof. The invention includes a 
diagnostic kit comprising a polynucleotide encoding SEQ ID NO: 6045 or a fragment thraeof. 
The invention includes an immunogenic composition comprising SEQ ID NO: 6045 or a' 
fragment thereof. The invention includes an antibody which recognizes a polypeptide ; 
comprising SEQ ID NO: 6045 or a fegment thereof. 

20 Predicted transmembrane regions of SEQ ID NO: 6045 are identified below: 

Inside to outside helices : 1 fovind 

from to score center 

17 ( 19) 33 ( 33) 2881 26 ■ • 

25 Outside to inside helices : 1 foimd 

from to score center 

17 ( 17) 34 ( 34) 2981 27 

Accordingly, the invention includes a polypeptide comprising a fragment of SEQ ID NO: 
30 6045 wherein said fragment does not include one or more of the hydrophobic amin o acid 

sequences identified above. Preferably, the fragment does not include the amino acids between 
positions 17 to 34. The invention also includes a polynucleotide sequence encoding any of the 
above-identified polypeptides. In one embodiment, the invention includes a polypeptide 
comprising a fragment of SEQ ID NO: 6G45 wherein said fragment does not include amino acid 
35 residues 1-34 of SEQ ID NO: 6045. 

Predicted protein Localization Site of SEQ ID NO: 6045 is below. 

PSORT Prediction o£ Protein Iiocalization Sites for SEQ ID NO: 6045 

version 6 . 4 (WWW) 

Species classification: 4 

40 

*** Reasoning Step: 1 
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Preliminary Calcnilation of ALOM (threshold: 0.5) ^ 

count : 2 

Position of the most N~ terminal TMS: 17 at i=l 
5 MTOP: membrane topology (Hartmann et al.) 

I (middle): 24 Charge diffirence(C-N) : 2.0 

McG: Examining signal sequence (McGeoch) 

Length of UR: 29 

Peak Value of UR: 3.40 

10 Net Charge of CR: -2 

Discriminant Score : 13.07 

GvH: Examining signal sequence (von Heijne) 

Signal Score (-3.5): 4.37 
Possible cleavage site: 32 
15 ... positive value of mtop . . . 

»> Seems to have an uncleavable N-term signal seq. 

Amino Acid Composition of Predicted Mature Form; 
calculated from 1 

ALOM new cnt : 1 * * thrshld changed to -2 
20 Cleavable signal was detected in ALOM? : OB 

ALOM: finding transmembrane regions (Klein et al.) 
count: 1 value: -15.12 threshold: -2.0 

INTEGRiUj Likelihood =-15.12 Transmembrane 17 - 33 ( .8 - 44) 

PERIPHERAL Likelihood = 0.47 

25 modified ALOM score : 3 . 12 

»> Seems to be a Type Ib (Nexo Ccyt) membrane protein 
The cytoplasmic tail is from 34 to 76 (44 Residues) 

Rule: vesicular pathway 
Rule : vesicular pathway 
JO Rule: vesicular pathway 
( 6) or uncleavable? 

Gavel: Examining the boundary of mitochondrial targeting seq. 
motif at: 6 

Uncleavable? Ipos set to: 16 
J5 Discrimination of mitochondrial target seq.: 

notclr (0.19) 

Rule: vesicular pathway 
Rule: vesicular pathway 
Rule: vesicular pathway 

10 

*** Reasoning Step: 2 

> Relative position of the end of the tail: 44% 

Memb. protein with uncleavable signl is often at ER 
1^5 KDEL Coimt : 0 

Checking apolar signal for intramitochondrial sorting 
(Gavel position 16) from: 70 to: 99 Score; 21.5 
»> Seems to have an intramitochondrial signal 
SKL motif (signal for peroxisomal protein) : 

»0 pos: -1(76), count: 0 

Amxno Acid Composition Tendency for Peroxisome: -4.11 

Peroxisomal proteins? Status: negative 
Amino Acid Conposition tendency for lysosomal proteins 
score: 0.68 Status: notclr 

■5 Checking the amount of Basic Residues (nucleus) 

Checking the 4 residue pattern for Nuclear Teirgeting 
Checking the 7 residue pattern for Nuclear Targeting 
Checking the Robbins & Dingwall consensus (nucleus) 

Checking the RNA binding motif (nucleus or cytoplasm) 

Nuclear Signal Status: negative ( 0.00) 

Check cytoplasmic tail for typelb (plasma memb.) 
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Checking the NPXY motif . . 

Checking the YXRF motif . . 

Checking N-rayristoylation . , 

5 Pinal Results 

plasma xneznbrane Certainty^ 0*730 (Affirmative) < suco 

endoplasmic reticulum (membrane) Certainty* 0.640 (Affirmative) < suco 

endoplasmic reticulum (lumen) Certainty* 0.100 (Affirmative) < suco 

outside Certainty* 0.100 (Affirmative) < suco 

10 

Predicted N-glycosylation sites of SEQ ID NO: 6045 are identified at residues 48 and 66: 

Position Potential Jury NGlyc 

agreement result 

48 NVSL 0.6514 (9/9) ++ (SEQ ID NO: 7266) 

15 66 NSSK 0.5880 (7/9) + (SEQ ID NO: 7267) 

Accordingly, the invention comprises a polypeptide comprising a fragment of the amino 
acid sequence of SEQ ID NO: 6045, wherein said fiagment comprises one or more of the 
N-glycosylation sites identified above. The invention comprises a polypeptide comprising a 
20 fiagment of SEQ ID NO: 6045 wh^in said fragment comprises one or more of the N- 

glycosylation sites identified above. Tbe invention further comprises a polynucleotide encoding 
one or more of the polypeptides identified above. 

Ibe invention includes a polypeptide comprising a fiagment of SEQ ID NO: 6045 wherein 
said fragment does not include one or more of the glycosylation sites identified above. The 
25 invention also includes a polynucleotide encoding such a polypeptide. 

T-epitopes for SEQ ID NO: 6045 are identified in Table 19. The invention includes a 
polypeptide for use as an antigen, wherein the polypeptide comprises: (a) an atnitm acid 
sequence selected fiom the group consisting of the T-epitope sequences identified in SEQ ID 
NOS: 8666-8820; (b) an amino acid sequence having sequence identity to an amino acid 
$0 sequence of (a). The invention further comprising a polynucleotide sequence trending the 
polypeptides of (a) or (b). The invention further comprising a method of expression or delivery 
of such polynucleotides through viral vectors and/or viral particles. The invention further 
comprises a polypeptide comprising two or more of the T-epitope sequences identified in SEQ 
ID NOS: 8666-8820, or a polynucleotide encoding such a polypeptide. 

15 The use as an antigen is preferably a use: (1) as a T-cell antigen; (2) for generating a 

complex between a class I MHC protein (e.g. a class I HLA) and a fragment of said antigen; (3) 
as an antigen for raising a cell-mediated immune response; and/or (4) as an antigen for raising a 
CTTL response. The use preferably protects or treats disease and/or infection caused by a SARS 
virus. The invention provides the use of a polypeptide in the manufacture of a medicament for 
0 immunising a mammal (typicaUy a human) against SARS viral infection wherein the polypeptide 
is as defined above. 
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The invention provides a method of raising an immune response in a mammal (typicaUy a 
human), comprising the step of administering to the mammal a polypeptide as defined above, 
wherein said immune response is a cell-mediated immune response and, preferably, a CTL 
response. The immune response is preferably protective or therapeutic. 

5 The invention includes a polypeptide sequence comprising SEQ ID NO: 6046. The 

invention includes polypeptide sequences comprising an amin o acid sequence having sequence 
identity to SEQ ID NO: 6046. The invention includes a polypeptide sequence comprising a 
fragment of SEQ ID NO: 6046. The invention includes a polynucleotide encoding one of these 
polypeptides. 

10 SEQ ID NO: 6046 has fimctional homology with a matrix protein of a coronavirus. The 

invention includes a diagnostic kit comprising a polypeptide comprising SEQ K) NO: 6046 or a 
fragment thereof. The invration includes a diagnostic kit comprising a polynucleotide encoding 
SEQ ID NO: 6046 or a fragment thereof. The invention includes an immunogenic composition 
comprising SEQ ID NO: 6046 or a fragment thereof. The invention includes an antibody which 
15 recognizes a polypeptide comprising SEQ ED NO: 6046 or a fragment thereof. 

Predicted transmembrane regions of SEQ ID NO: 6046 are identified below. 



Inside to outside helices : 3 found 





■ from 






to 


score 


center 


21 


{ 21) 


38 


{ 


36) 


2412 


29 


51 


( 53) 


69 


( 


69) 


2645 


60 


74 


( 82) 


96 


( 


96) 


2464 


89 


Outside to 


inside 


helices : 


3 found 




from 






to 


score 


center 


18 


( 21) 


38 


( 


38) 


2363 


28 


52 


( 52) 


67 


( 


67) 


2363 


60 


76 


( 76) 


95 


( 


92) 


2605 


84 



Accordingly, the invention includes a polypeptide comprising a fragment of SEQ ID NO: 
30 6046 wherein said fragment does not include one or more of the hydrophobic amino acid 

sequences identified above. Preferably, the fragment does not include the amino acids between 
positions selected from the group consisting of 18 to 38, 52 to 67 and 76 to 95. The invention 
also includes a polynucleotide sequence encoding any of the above-identified polypeptides. 

Predicted protein localization of SEQ ID NO: 6046 is set forth below. 

35 FSORT Prediction of Protein Localization Sites 

version 6.4 (WWW) 

Species classification: 4 



*** Reasoning Step: 1 

40 

Preliminary Calculation of AIiOM (threshold: 0.5) 
count : 3 

Position of the most N-terminal TMS: 21 at i=l 
MTOP: membrane topology (Hartmann et ai.) 

45 I (middle): 28 Charge diffirence(C-N) : 6.0 
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McG: Examining signal sequence (McGeoch) 

Iiength of UR: 1 

Peak Value of UR: 3.16 

Net Charge of CR: -3 

5 Discriminant Score: 2.21 

GvH: Examining signal sequence (von Heijne) 

Signal Score (-3.5): 4.29 
Possible cleavage site: 39 
. . . positive value of mtop . . . 

10 »> Seems to have an imcleavable N-term signal seq. 

Amino Acid Composition of Predicted Mature Form: 
calculated from 1 

Cleavable signal was detected in ALOM?: OB 
ALOM: finding transmembrane regions (Klein et al.) 
15 count: 3 value: -7.64 threshold: 0.5 



INTEX3RAL 


Likelihood = 


-7.64 


Transmembrane 


21 - 


37 


( 


18 - 


39) 


INTEC3RAL 


Likelihood = 


-7.59 


Transmembrane 


50 - 


66 


( 


43 - 


72) 


INTEX3RAL 

PERIPHERAL 


Likelihood = 
Likelihood = 


-5.04 

2.38 


Transmembrane 


79 - 


95 


( 


72 - 


99) 



20 modified ALOM score: 2.13 

»> Likely a Type Illb membrane protein (Nexo Ccyt) 

Rule: vesiculcir pathway 
Rule: vesicular pathway 
Rule: vesicular- pathway 
25 ( . 2) or uncleavable? 

CSavel: Examining the boundary of mitochondrial targeting seq. 

‘ motif at: 2 

Uncleavable? Ipos set to: 12 
Discrimination of mitochondrial target seq. : 

30 negative (-4.16) 

Rule : vesicular pathway 
Rule: vesicular pathway 
Rule: vesicular pathway 

35 *** Reasoning Step: 2 

Type Ilia or Illb is favored for ER memb, proteins 
Memb. protein with uncleavable sign! is often at ER 
KDEL Count : 0 

40 Checking apolar signal for intramitochondrial sorting 

SKL motif (signal for peroxisomal protein) : 
pos: -1(221), count: 0 

Amino Acid Composition Tendency for Peroxisome: 5.01 

Peroxisomal proteins? Status: notclr 
45 Amino Acid Composition tendency for lysosomal proteins 
score: 2.30 Status: positive 

Type III proteins may be localized at Golgi 
Checking the amount of Basic Residues (nucleus) 

Checking the 4 residue pattern for Nuclear Targeting 
50 Checking the 7 residue pattern for Nuclear Targeting 
Checking the Robbins & Dingwall consensus (nucleus) 

Checking the RNA binding motif (nucleus or cytoplasm) 

Nuclear Signal Status: negative ( 0.00) 

Check the Number of TMSs for typelll (plasma memb.) 

55 Checking N-myristoylation . . 

Final Results -- 

endoplasmic reticulum (mexnbrane) — Certainty^ 0.685 (Affirmative) < suco 

plasma membrEuie Certainty= 0.640 (Affirmative) < suco 

Golgi bo^F — * Certainty^ 0.460 (Affirmative) < suco 

endoplasmic reticulum (lumen) Certainty^ 0.100 (Affirmative) < suco 
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One predicted N-glycosylation sites of SEQ E> NO: 6046 is identified at residue 4: 

Prediction ofN-elvcossiatum sites 

Position Potential Jury NGlyc 
5 agreement result 

4 NGTI 0.8430 (9/9) +++ (SEQ ID NO: 7268) 

Accordingly, the invention conopiises a polypeptide comprising a fragment of the amino 
acid sequence of SEQ ID NO: 6046, wherein said fragment comprises the N-^ycosylation site 
10 identified above. TTie invention further comprises a polynucleotide enco ding one or more of the 
polypeptides identified above. 

The invention further comprises a polypeptide comprising a fragment of amin o acid 
sequence SEQ ID NO: 6046, wherein said fragment does not include the N-glycosylation site 
identified above. The invention includes a polynucleotide encoding such a fragment 
15 A variant of SEQ ID NO: 6046 that is included within tire invention is SEQ ID NO: 9963. 

Compared to SEQ ID NO: 6046, this sequence has Val at residue 72 instead of Ala. 

T-epitopes for SEQ ID NO: 6046 are identified in Table 20. The invention includes a 
polypeptide for use as an antigen, wherein the polypeptide comprises: (a) an amino acid 
sequence selected fiom the group consisting of the T-epitope sequences identified in SEQ ID 
20 NOS: 8821-9018; (b) an amino acid sequence having sequence identity to an amino acid 
sequence of (a). The invention further comprising a polynucleotide sequence encoding the 
polypeptides of (a) or (b). The invention further comprising a method of expression or delivery 
of such polynucleotides through viral vectors and/or viral particles. The invention further 
comprises a polypeptide comprising two or more of the T-epitope sequences identified in SEQ 
25 ID NOS: 8821-9018, or a polynucleotide encoding such a polypeptide. 

The use as an antigen is preferably a use: (1) as a T-cell antigen; (2) for generating a 
complex between a class I MHC protein (e.g. a class I HLA) and a fragment of said antigen; (3) 
as an antigen for raising a cell-mediated immune response; and/or (4) as an antigen for raising a 
Cl’L response. The use preferably protects or treats disease and/or infection caused by a SARS 
30 virus. The invention provides the use of a polypeptide in the manufacture of a medicament for 
immunising a mammal (typicaUy a human) against SARS viral infection wherein the polypeptide 
is as defined above. 

The invention provides a method of raising an immune response in a mammal (typically a 
human), comprising the step of administering to the m ammal a polypeptide as defined above, 

35 wherein said immune response is a ceU-mediated iimnune response and, preferably, a CTTL 

response. The immime response is preferably protective or therapeutic. 
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Hie invention includes a polypeptide sequence compnsing SEQ ID NO: 6047 or a 
fragment thereof or an amino add sequence having sequence identity thereto. Predicted 
tiansmemhrane regions of SEQ ID NO: 6047 are identified below. 

Inside to outside helices : 2 found 

5 from to score center 

7 ( 10) 29 ( 27) 729 17 

21 ( 24) 41 ( 41) 640 34 

Outside to inside helices : 2 fo\md 

10 from to score center 

4 ( 4) 22 ( 19) 874 12 

22 ( 24) 41 ( 41) 499 31 

Accordingly, the invention includes a polypeptide comprising a fragment of SEQ ID NO: 
15 6047 wherein said fragment does not include one or more of the hydrophobic amin o add 

sequences identified above. Preferably, the fragment does not include the amin o adds between 
positions selected from the group consisting of 4 to 22 and 22 to 41. The invention also includes 
a polynucleotide sequrace encoding any of the above-identified polypeptides. 

. SEQ ID NO: 6047 is predicted to be a hypothetical protein of SARS virus. A prediction of 
20 the protein localization of SEQ ID NO: 6047 is set forth below. SEQ ID NO: 6047 is predicted 
to be located in one of the following locations: plasma membrane, endoplasmic reticulum, Golgi 
body, and microbody (jperoxisome). SEQ ID NO: 6047 may be associated with an organelle 
inside an infected cell or with viral entry to a host cell. 

. Accordingly, SEQ ID NO: 6047 is a target for screening of chemical inhibitors to the 
25 SARS virus. The invention includes a polypeptide comprises SEQ ID NO: 6047 or a fragment ' 
thereof. Hie invention includes a polynucleotide encoding the polypeptide sequence of SEQ ID 
NO: 6047 or a fi:agment thereof. The invention includes a method of screening SEQ ID NO: 
6047 for an inhibitor. The invention includes the recombinant expression of SEQ ED NO: 6047 
in a host ceU. The invention includes a small molecule which prevents the polypeptide of SEQ 
30 ID NO: 6047 from associating with an organelle inside of an infected cell or interacting with a 
host cell membrane. Hie invention includes a fusion protein wherein said fusion protein 
comprises SEQ ID NO: 6047. Predicted protein localization of SEQ ID NO: 6047 is set forth 
below. 

PSORT Prediction o£ Protein Ziocalization Sites 

35 version 6 . 4 (WWW) 

Species classification: 4 

*** Reasoning Step: 1 

40 Preliminary Calculation of ALOM (threshold: 0.5) 

count : 1 

Position of the most N- terminal TMS: 2 at i=l 
MTOP: membrane topology (Hartmann et al.) 

I(middle): 9 Charge diffirence(C-N) : 0.5 

McG: Exa m i n ing signal sequence (Mc(3eoch) 

-57- 



45 




wo 2004/092360 



PCT/US2004/011710 



Length of UR: 6 

Peak Value of DR; 3.08 
Net Charge of CR: 0 
Discriminant Score: 5.12 

5 GvH: Examining signal sequence (von Heijne) 

Signal Score (-3.5): -4.45 
Possible cleavage site: 34 
»> Seems to have an uncleavable N-term signal seq. 

Amino Acid Composition of Predicted Mat\xre Form: 

10 calculated from 1 

ALOM new cnt: 1 ** thrshld changed to -2 
Cleavable signal was detected in ALOM?: OB 
ALOM: finding transmembrane regions (Klein et al.) 
count : 1 value : -2.44 threshold : -2.0 
1^ INTEGRAL Likelihood = —2.44 Transmembrane 2 — 18 ( 

PERIPHERAL Likelihood = 1.22 

modified ALOM score: 0.59 

»> Seems to be a Type II (Ncyt Cexo) membrane protein 
The cytoplasmic tail is from 1 to 1 (1 Residues) 

20 Rule: vesicular pathway 

R.ule: vesicular pathway 
Rule: vesicular pathway 
( 5) or iincleavable? 

Gavel: Examining the boundary of mitochondrial targeting seg. 

25 motif at: 5 

Uncleavable? Ipos set to: 15 
Discrimination of mitochondrial target seq. : 
notclr (1.48) 

Rule: vesicular pathway 
30 Rule: vesicular pathway 

Rule: vesicular pathway 

*** Reasoning Step: 2 

^5 Relative position of the cytoplasmic tail : 1% 

Larger value (>30%) is favared for ER memb. proteins 

Memb. protein with uncleavable signl is often at ER 
KDEL CoTint : 0 

Checking apolar signal for intramitochondrial sorting 
to (Gavel position 15) from: 64 to: 93 Score: 30.0 

>» Seems to have an intramitochondrial signal 
SKL motif (signal for peroxisomal protein) : 
pos: -1(63), count: 0 

Amino Acid Composition Tendency for Peroxisome: 1.91 

t5 Peroxisomal proteins? Status: notclr 
AAC score (peroxisome): 0.161 
Amino Acid Composition tendency for lysosomal proteins 
score: 0.04 Status: notclr 

Checking the consensus for Golgi 
iO Checking the consensus for Golgi 

Checking the cytoplasmic tail of type II. (Golgi) 

Checking the amount of Basic Residues (nucleus) 

Checking the 4 residue pattern for Nuclear Targeting 
Checking the 7 residue pattern for Nuclear Targeting 
^5 Checking the Robbins & Dingwall consensus (nucleus) 

Checking the RNA binding motif (nucleus or cytoplasm) 

Nuclear Signal Status: negative (0.00) 

Check mitochondrial signal for typell (plasma memb. ) 

Type II is favored for plasma memb. proteins 
0 Checking the NPXY motif.. 

Checking the YXRF motif .. . 



1 - 20 ) 
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Checking N-nyristoylation . . 

Pinal Results 

plasma membrane — Certainty^ 0.685 (Affirmative) < suco 
5 endoplasmic reticulum ( membr ane ) — - Certainty^ 0.640 (Affirmative) < suco 

Golgi body Certcdmtys 0.370 (Affirmative) < suco 

microbocty (peroadLsoase) Certainty^ 0.161 (Affirmative) < suco 

T-epitopes for SEQ ID NO: 6047 are identified in Table 21. The invention includes a 
10 polypeptide for use as an antigen, wherein the polypeptide comprises: (a) an amino acid 

sequence selected from the group consisting of the T-epitope sequences identified in SEQ ID 
NOS: 9019-9131; (b) an amino acid sequence having sequence identity to an amino acid 
sequence of (a). Tlie invention further comprising a polynucleotide sequence encoding the 
polypeptides of (a) or (b). The invention further comprising a method of expression or delivery 
15 of such polynucleotides through viral vectors and/or viral particles. The invention further 

comprises a polypeptide comprising two or more of the T-epitope sequences identified in SEQ 
ID NOS: 9019-9131, or a polynucleotide encoding such a polypeptide. 

The use as an anti^n is preferably a use: (1) as a T-cell antigen; (2) for generating a 
complex between a class I MHC protdm {e.g. a class I HLA) and a fingment of said antigen; (3) 
20 as an antigen for raising a cell-mediated immu ne response; and/or (4) as an antigen for raising a = 
CYL response. The use preferably protects or treats disease and/or infection caused by a SARS 
virus. TTie invention provides the use of a polypeptide in the manufacture of a medicament for 
immunising a mammal (typically a human) against SARS viral infection wherein the polypeptide 
is as defined above. 

25 The invention provides a method of raising an immune response in a mammal (typically a 

human), comprising the step of administering to the mammal a polypeptide as defined above, 
wherein said immune response is a cell-mediated immune response and, preferably, a CTTL 
response. The immime response is preferably protective or therapeutic. 

The invention includes a polypeptide comprising SEQ ID NO: 6048, a fragment thereof or 
30 an amino acid sequence having sequence identity thereto. Predicted transmembrane regions of 
SEQ ID NO: 6048 are identified below. 

Inside to outside helices : 2 found 

from to score center 

3 { 3) 18 ( 18) 1857 10 

35 100 ( 100) 117 ( 115) 2904 107 

Outside to inside helices : 2 found 

from to score center 

1 { 1) 15 ( 15) 1299 8 

40 100 ( 100) 117 { 115) 3009 107 

Accordingly, the invention includes a polypeptide comprising a fragment of SEQ ID NO: 
6048 wherein said fragment does not include one or more of the hydrophobic amino acid 
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5 



10 



15 



!0 



sequences identified above. Preferably, the fragment does not include the amino acids between 
positions selected fiom the group consisting of 1 to 15 and 100 to 117. The invention also 
includes a polynucleotide sequence encoding any of the above-identified polyiwptides. 

SEQ ID NO: 6048 is predicted to be a hypothetical protein of SAKS vims. A prediction of 
the protein localization of SEQ ID NO: 6048 is set forth below. SEQ ID NO: 6048 is predicted 
to be located in one of the following locations: plasma membrane, lysosome (membrane), 
microbody (peroxisome), and endoplasmic reticulum (membrane). SEQ ID NO: 6048 may be 
associated with an organelle inside an infected ceU or may interact with a host ceU plasma 
membrane during viral entry to the host cell. 

Accordingly, SEQ ID NO: 6048 is a target for screening of chemical inhibitors to the 
SAKS virus. The invention includes a polypeptide comprises SEQ ID NO: 6048 or a fiagment . 
thereof. The invention includes a polynucleotide encoding the polypeptide sequence of SEQ ID 
NO: 6048 or a fragment thereof. The invention includes a method of screening SEQ ID NO: 

6048 for an inhibitor. The invention includes the recombinant expression of SEQ ID NO: 6048 
in a host cell, nie invention includes a small molecule which prevents the polypeptide of SEQ 
ID NO: 6048 fiom associating with an organeUe inside of an infected cell or prevents the 
polypeptide fiom associating with the cell membrane of a host ceU. The invention includes a 
fusion protein wherein said fusion protein comprises SEQ ID NO: 6048. Predicted protein 
localization of SEQ ID NO: 6048 is set forth below. 



PSORT — Prediction of Protein Localization Sites 



Species classification: 4 



version 6 . 4 (WWW) 



:5 



0 



5 



3 



5 



*** Reasoning Step: 1 

Preliminary Calculation of ALOM (threshold: 0.5) 
coimt : 2 

Position of the most N-terminal TMS: 3 at i=2 
MTOP: membrane topology (Hartmann et al.) 

I (middle): 10 Charge diffirence(C-N) : ~2.5 
McG: Examining signal sequence (McGeoch) 

Length of UR: 13 

Peak Value of UR: 3.38 

Net Charge of CR: 1 
Discriminant Score: 10.02 

C5vH: Examining signal sequence (von Heijne) 

Signal Score (-3.5): 2.56 
Possible cleavage site: 15 
»> Seems to have a cleavable N-term signal seq. 

Amxno Acid Conqposition of Predicted Mature Form: 
calculated from 16 

ALOM new cnt: 2 ** thrshld changed to -2 
Cleavable signal was detected in ALOM?: IB 
ALOM: f in ding transmembrane regions (Klein et al.) 
count: 1 value: -14.75 threshold: - 2.0 

INTEGRAL Likelihood =-14.75 Transmembrane 101 - 117 ( 95 - 190 ^ 

PERIPHERAL Likelihood =6.63 ^ 

modified ALOM score: 3.05 
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»> Seems to be a Type la membrane protein 

The cytoplasmic tail is from 118 to 122 (5 Residues) 
Rule : vesicular pathway 
Rule: vesicular pathway 
5 Rule: vesicular pathway 

(15) or uncleavable? 

Gavel: Exa min i n g the boundary of mitochondrial targeting seg 
motif at: 15 

Ctacleavable? Xpos set to: 25 
10 Discrimination of mitochondrial target seq. : 

notclr ( 0.73) 

Rule: vesicular pathway 
Rule: vesicular pathway 
Rule: vesicular pathway 

*** Reasoning Step: 2 



20 



25 



JO 



J5 



10 



Final Results 

plasma membrane Ceztaintys 0.919(Afflxinative) < suco 

)5 lysososna (membrane) Certainty^ 0.2 00 (Affirmative) < suco 

miorobo^ (perosiscane) — Certainty^ 0.115 (Affirmative) < suco 
endoplasmic reticulum (membrane) Certainty^ 0.100 (Affirmative) < suco 

T-epitopes for SEQ ID NO: 6048 are identified in Table 22. The invention includes a 
® polypeptide for use as an antigen, wherein the polypeptide comprises: (a) an amino acid 

sequence selected fiom the group consisting of the T-epitope sequences identified in SEQ ID 
NOS: 9132-9308; (b) an amino acid sequence having sequence identity to an amino acid 
sequraice of (a). Tlie invention further comprising a polynucleotide sequence encoding the 
polypeptides of (a) or (b). The invention further comprising a method of expression or delivery 
5 of such polynucleotides through viral vectors and/or viral particles. The invention further 



KDEL Count : 0 

Checking apolar signal for intramitochondrial sorting 
(Gavel position 25) from: 3 to: 12 Score: 8.5 

SKL motif (signal for peroxisomal protein) : 
pos: -1(122), count: 0 

Amino Acid Conposition Tendency for Peroxisome: 2.46 

AAC not from the N-term. , score modified 
Peroxisomal proteins? Status: notclr 
AAC score (peroxisome): 0.115 
Amino Acid Conqposition tendency for lysosomal proteins 
score: -0.40 Status: negative 
GY motif in the tail of typela? (lysosomal) 

Checking the amount of* Basic Residues (nucleus) 
Checking the 4 residue pattern for Nuclear Targeting 
Checking the 7 residue pattern for Nuclear Targeting 
Checking the Robbins & Dingwall consensus (nucleus) 
Checking the RNA binding motif (nucleus or cytoplasm) 
Nuclear Signal Status: negative ( 0.00) 

Type la is favored for plasma memb. proteins 
Checking the NPXY motif.. 

Chec)cing the YXRF motif.. 

Checking N-nryristoylation. . 

Checking GPI anchor. . 

>» Seems to be GPI-anchored (0.85) 
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comprises a polypeptide comprising two or more of the T-epitope sequences identified in SEQ 
ID NOS: 9132-9308, or a polynucleotide encoding such a polypeptide. 

Hie use as an antigen is preferably a use: (1) as a T-ceU antigen; (2) for generating a 
complex between a class I MHC protein (e.g. a class I HLA) and a fiagment of said antigen; (3) 

5 as an antigen for raising a cell-mediated immune response; and/or (4) as an antigen for raising a 

CTL response. The use preferably protects or treats disease and/or infection caused by a SARS 
virus. Tbe invention provides the use of a polypeptide in the manufacture of a medicament for 
immunising a mammal (typically a human) against SARS viral infection wherein the polypeptide 
is as defined above. 

10 The invention provides a method of raising an immune response in a maimnai (typically a 

human), comprising the step of administering to the mammal a polypeptide as defined above, 
wherein said immune response is a cell-mediated immune response and, preferably, a CTL 
response. The immune response is preferably protective or therapeutic. 

The invention includes a polypeptide comprising SEQ ID NO: 6049, a fiagment thereof or 

15 an amino acid sequence having sequence identity thereto. Predicted transmembrane or 
hydrophobic regions of SEQ ID NO: 6049 are identified below. 

Inside to outside helices : 1 found 

from to score center 

13 { 13) 30 { 28) 3532 20 

20 

Outside to inside helices : 1 found 

from to score center 

9 ( 11) 29 ( 26) 3395 19 

25 Accordingly, the invention includes a polypeptide comprising a fragment of SEQ ID NO: 

6049 wherein said fiagment does not include one or more of the hydrophobic amino acid 
sequences identified above. The invention also includes a polynucleotide sequence encoding any 
of the above-identified polypeptides. 

SEQ ID NO: 6049 is predicted to be a hypothetical protein of SARS virus. A prediction of 

30 the protein localization of SEQ ID NO: 6049 is set forth below. SEQ ID NO: 6049 is predicted 
to be located in one of the following locations: outside, microbody (peroxisome), endoplasmic 
reticulum (membrane) and endoplasmic reticulum Oumen). The highest ranking indicates that 
SEQ ID NO: 6049 is located on the outside of a cell. Accordingly, SEQ ID NO: 6049 ma y be a 
surface exposed protein. 

35 Accordingly, SEQ ID NO: 6049 may be used in an immunogemc composition to raise an 

immune response against the SARS virus. It also may be used to generate antibodies specific to 
the SARS virus. Such antibodies may be used in a method of treatment or prevention of a SARS 
virus infection. Such antibodies may further be used in a diagnostic test to identify the presence 
or absence of SARS virus in a biological sample. 
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The invention includes a polypeptide comprises SEQ ID NO: 6049 or a fragment thereof. 
The invention includes a polynucleotide encoding the polyp^tide sequence of SEQ ID NO: 

6049 or a fragment thaeof. The invention includes a method of screening SEQ ID NO: 6049 for 
an inhibitor. The invention includes the recombinant expression of SEQ ID NO: 6049 in a host 
5 cell. The invention includes a fusion protein wherein said fusion protein comprises SEQ ID NO: 
6049. Predicted protein localization of SEQ ID NO: 6049 is set forth below. 

PSORT Prediction of Protein Localization Sites 

version 6. 4 (WWW) 

Species classification: 4 

0 

*** Reasoning Step: 1 



15 



20 



25 



50 



J5 



\0 



5 



0 



5 



Preliminary Calculation of ALOM (threshold: 0.5) 
count : 1 

Position of the most N-tenninal TMS: 11 at i=l 
MTOP: membrane topology (Hartmann et ai.) 

I (middle): 18 Charge diff irence (C-N) : -2.0 
McG: Examining signal sequence (McGeoch) 

Length of UR: 24 

Peak Value of UR: 3.69 

Net Charge of CR: -2 
Discriminant Score: 13.56 

GvH: Examining signal sequence (von Heijne) 

Signal Score (-3.5); 0.52 
Possible cleavage site: 25 
»> Seems to have a cleavable N-term signal seq. 

Amino Acid Contposition of Predicted Mature Form: 
calculated from 26 

ALOM new cnt: 1 ** thrshld changed to -2 
Cleavable signal was detected in ALOM? : IB 
ALOM: finding transmembrane regions (Klein et al.) 
count: 0 value: 14.80 threshold: -2.0 

PERIPHERAL Likelihood = 14.80 
modified ALOM score; -3.86 
Rule : vesicular pathway 
Rule: vesicular pathway 
Rule: vesicular pathway 
( 2) or uncleavable? 

(5avel: Examining the boundary of mitochondrial targeting seq. 
motif at: 2 

Uncleavable? Ipos set to: 12 
Discrimination of mitochondrial target seq. : 
notclr ( 1.42) 

Rule : vesicular pathway 
Rule: vesicular pathway 
Rule: vesicular pathway 

*** Reasoning Step: 2 

KDEL Count ; 0 

Number of Potential N-glycosylation Sites: 0 
Out: score 0.800 

Checking apolar signal for intreunitochondrial sorting 
(Gavel position 12) from: 44 to: 73 Score: 30.0 
»> Seems to have an intramitochondrial signal 
SKL motif (signal for peroxisomal protein) : 
pos: -1(44), count: 0 
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Amino Acid Composition Tendency for Peroxisome: 9.47 

AAC not from the N-tenn. , score modified 
Peroxisomal proteins? Status: notclr 
AAC score (peroxisome): 0.320 

5 Amino Acid Composition tendency for lysosomal proteins 
score: -6.47 Status: negative 
Number of NX(S/T) motif: 0 

Chec)d.ng the amount of Basic Residues (nucleus) 

Checlcing the 4 residue pattern for Nuclear Tcurgeting 
10 Checlcing the 7 residue pattern for Nuclear Targeting 
Checking the Robbins & Dingwall consensus (nucleus) 

Checlcing the RNA binding motif (nucleus or cytoplasm) 

Nuclear Signal Status: negative ( 0.00) 

Checlcing CaaX motif. . 

15 Checking N-myristoylation. . 

Checking CaaX motif . . 

Final Results 

outside Certainty^ 0.820 (Affirmative) < suco 

20 microbody (peroxisoane) Certainty^ 0.320 (Affirmative) < suco 

endoplasmic reticnilum (membrane) — — — Certainty^ 0.100 (Affirmative) < suco 
endoplasmic reticulum (lumen) Certainty^ 0.100 (Affirmative) < suco 

T-epitopes for SEQ ID NO: 6049 are identified in Table 23. The invootion includes a 
25 polypeptide for use as an antigen, wherein the polypeptide comprises: (a) an amino add 

sequence selected from the group consisting of the T-epitope sequences identified in SEQ ID 
NOS: 9309-9437; (b) an amino add sequence having sequence identity to an amino add 
sequence of (a). The invention further comprising a polynucleotide sequence encoding the 
polypeptides of (a) or (b). The invention further comprising a method of expression or delivery 
30 of such polynucleotides througli viral vectors and/or viral particles. The invention further 

comprises a polypeptide comprising two or more of the T-epitope sequences identified in SEQ 
ID NOS: 9309-9437, or a polynucleotide encoding such a polypeptide. 

The use as an antigen is preferably a use: (1) as a T-cell antigen; (2) for generating a 
complex betwerai a class I MHC protdn (e.g. a class I HLA) and a fragment of said antigen; (3) 
35 as an antigen for raising a cell-mediated immu ne response; and/or (4) as an antigen for raising a 

CTL response. The use preferably protects or treats disease and/or infection caused by a SAKS 
virus. The invention provides the use of a polypeptide in the manufacture of a medicament for 
immunising a mammal (typically a human) against SARS viral infection wherein the polypeptide 
is as defined above. 

40 The invention provides a method of raising an imm une response in a mammal (typically a 

human), comprising the step of administering to the mammal a polypeptide as defined above, 
wherein said immune response is a cell-mediated immune response and, pref^bly, a CTL 
response. The immune response is preferably protective or therapeutic. 
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Hie invention includes a polypeptide comprising SEQ ID NO: 6050 or a fragment thmeof 

or an amino acid sequence having sequence identity thereto. Predicted transmembrane or 

hydrophobic regions are identified below. 

Inside to outside helices : 1 found 

5 from to score centef* 

13 ( 15) 32 { 30) 558 23 

Outside to inside helices : 1 found 

from to score center 

10 16 ( 16) 30 ( 30) 364 23 



15 
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Accordingly, the invention includes a polypeptide conqnising a fragment of SEQ ID NO: 
6050 wherein said fragment does not include one or more of the hydrophobic amino acid 
sequences identified above. The invention also includes a polynucleotide sequence encoding any 
of the above-identified polypeptides. 

SEQ ID NO: 6050 is predicted to be a hypothetical protein of SARS virus. A prediction of 
the protein localization of SEQ ID NO: 6050 is set forth below. SEQ ID NO: 6050 is predicted 
to be located in one of the following locations: lysosome Qumen), mitochondrial matrix space, 
mitochondrial inner membrane, and mitochondrial intermembrane space. SEQ ID NO: 6050 
may be associated with an organelle inside an infected cell durmg the viral replication cycle. 

Accordingly, SEQ ID NO: 6050 is a target for screening of chemical inhibitors to the 
SARS virus. The invention includes a polypeptide comprises SEQ ID NO: 6050 or a fiagment 
thereof. Hie invention includes a polynucleotide encoding the polypeptide sequence of SEQ ID 
NO: 6050 or a fragment thCTeof. The invention include a method of screening SEQ ID NO: 

6050 for an inhibitor. The invention includes the recombinant expression of SEQ ID NO: 6050 
in a host cell. The invention includes a small molecule which prevents the polypeptide of SEQ 
ID NO: 6050 fiom associating with an organelle inside of an infected cell or prevents the 
polypeptide from associating with the cell membrane of a host cell. Hie invention includes a 
fusion protein wherein said fusion protein comprises SEQ ID NO: 6050. Predicted protein 
localization of SEQ ID NO: 6050 is set forth below. 



PSORT Prediction of Protein Localization Sites 



MYSEQ 84 Residues 

Species classification: 4 



version 6.4 (WWW) 



*** Reasoning Step: 1 



Preliminary Calculation of MjOJA (threshold: 0.5) 
count : 0 

McG: Examining signal sequence (McGeoch) 

Length of UR: 3 

Peak Value of DR; 1.46 
Net Charge of CR: 2 
Discriminant Score : -5.73 

GvH; Examining signal sequence (von Heijne) 
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Signal Score (-3.5): -0.12 
Possible cleavage site: 29 
»> SeCTis to have no N- terminal signal seq. 

Amino Acid Composition of Predicted Mature Form: 

5 calcnilated from 1 

ALOM new cnt: 0 ** thrshld changed to -2 
Cleavable signal was detected in AIiOM?: OB 
AIiOM: finding transmembrane regions (Klein et al.) 
count: 0 value: 8.43 threshold: -2.0 

10 PERIPHERAL Likelihood = 8.43 

modified ALOM score: -2.59 

Gavel: E xam i n i n g the boundary of mitochondrial tsorgeting seq. 
motif at: 61 
ARCWYL 

15 Discrimination of mitochondrial target seg. : 

positive ( 1.66) 

Rule : mitochondrial protein 
Rule: mitochondrial protein 
Rule: mitochondrial protein 
20 Rule: mitochondrial protein 

*** Reasoning Step: 2 

KDEL Coiint: 0 

Checking apolar signal for intramitochondrial sorting 
(Gavel position 61) from: 52 to: 58 Score: 6.0 

Mitochondrial matrix? Score: 0.38 

SKL motif (signal for peroxisomal protein): 
pos: -1(84), count: 0 

Amino Acid Composition Tendency for Peroxisome: 1.47 

Peroxisomal proteins? Status: notclr 
AAC score (peroxisome): 0.263 
Amino Acid Composition tendency for lysosomal proteins 
score: 2.86 Status: positive 

Modified score for lysosome: 0.850 
Checking the amount of Basic Residues (nucleus) 

Checking the 4 residue pattern for Nuclear Targeting 
Checking, the 7 residue pattern for Nuclear Targeting 
Checking the Robbins & Dingwall consensus (nucleus) 

Checking the RNA binding motif (nucleus or cytoplasm) 

Nuclear Signal Status: negative ( 0.00) 

Checking CaaX motif . . 

Checking N-inyristoylation. . 

Checking CaaX motif. . 

Final Results 

lysosome (lumen) Certaintys 0.850 (Affirmative) < suco 

mitochondrial matrix space - — Certainty= 0.544 (Affirmative) < suco 

mitoch on d r ial inner membrane Certainty^ 0.266 (Affirmative) < suco 

mitoc h o nd rial intermembrane space Certaintya 0.266 (Affirmative) < suco 

One predicted N-glycosylation sites of SEQ ID NO: 6050 is identified at residue 43: 

Position Potential J\iry NGlyc 

agreement result 
55 43 NVTI 0.6713 (9/9) ++ (SEQ ID NO: 7269) 

Accordingly, the invention comprises a polypeptide comprising a fragment of the amino 
acid sequence of SEQ ID NO: 6050 wherein said fragment comprises the N-glycosylation site 
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identified above. ITie invention further compile a polynucleotide encoding one or more of the 
polypeptides identified above. 

The invention further con^nises a polypq)tide comprising a fragment of amino acid 
sequence SEQ ID NO: 6050 wherein said fragment does not include the N-glycosylation site 
5 identified above. The invention includes a polynucleotide encoding such a fingmenL 

T-epitopes for SEQ ID NO: 6050 are identified in Table 24. TTie invention includes a 
polypeptide for use as an antigen, wherein the polypeptide comprises: (a) an amino acid 
sequence selected fiom the group consisting of the T-epitope sequences identified in SEQ ID 
NOS: 9438-9538; (b) an amino acid sequence having sequence identity to an amin o acid 
10 sequence of (a). The invention further comprising a polynucleotide sequence encoding the 
polypeptides of (a) or (b). The invention further comprising a method of expression or delivery 
of such polynucleotides through viral vectors and/or viral particles. The invention further 
comprises a polypeptide comprising two or more of the T-epitope sequences identified in SEQ 
ID NOS: 9438-9538, or a polynucleotide encoding such a polypeptide. ' 

15 The use as an antigen is preferably a use: (1) as a T-cell antigen; (2) for generating a 

complex between a class I MHC protein (e.g. a class I HLA) and a fragment of said antigen; (3) 
as an antigen for raising a cell-mediated immime response; and/or (4) as an antigen for raising a 
CTL response. The use preferably protects or treats disease and/or infection caused by a SARS 
virus. The invention provides the use of a polypeptide in the manufacture of a medicament for 
20 immunising a mammal (typically a human) against SARS viral infection wherein the polypeptide 
is as defined above. 

The invention provides a method of raising an immu ne response in a mammal (typically a 
human), comprising the step of administering to the mammal a polypeptide as defined above, 
wherein said immune response is a cell-mediated immune response and, preferably, a CTL 
25 response. The immune response is preferably protective or therapeutic. 

The invention includes a polypeptide sequence comprising SEQ ID NO: 6051 or a 
fiagment thereof or an amino acid sequence having sequence identity thereto. The invention 
includes a polypeptide sequ^ce comprising SEQ ID NO: 6052 or a fragment thereof or an 
amino acid sequence having sequence identity thereto. 

10 SEQ ID NO: 6051 and SEQ ID NO: 6052 demonstrate functional homology with a 

nucleocapsid protein of a coronavirus. The invention includes a diagnostic kit comprising a 
polypeptide comprising SEQ ID NO: 6051, SEQ ID NO: 6052 or a fragment thereof. The 
invention includes a diagnostic kit comprising a polynucleotide encoding SEQ ID NO: 6051, 

SEQ HD NO: 6052 or a fiingment thereof. The invention includes an inununogenic composition 
15 comprising SEQ ID NO: 6051, SEQ ID NO: 6052 or a fragment thereof. The invention includes 
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an antibody which recognizes a polypeptide comprising SEQ ID NO: 6051, SEQ ID NO: 6052 
or a fragment thereof. 

SEQ ID NO: 6051 is predicted to be phosphorylated at Ser-79; Thr-92; Ser-106; “nir-lld; 
Thr-142; Ser-184; Ser-188; Ser-202; Ser-236; Thr-248; Ser-251; Ser-256; Thr-377. 

5 Accordingly, the invention includes a polypeptide comprising a fragment of SEQ ID NO: 6051 

wherein said fragment includes one or more of the amin o acid residues of SEQ ID NO: 6051 

selected from the group consisting of Ser-79; ’Ilir-92; Ser-106; Thr-116; Thr-142; Ser-184; Ser- 

188; Ser-202; Ser-236; Thr-248; Ser-251; Sct- 256; Thr-377. The invention further includes a 

polypeptide comprising a fragment of SEQ ID NO: 6051 wherein said fragment does not include 

10 one or more of the apoino acid residues of SEQ ID NO: 6051 selected from the group consisting 

of Ser-79; Thr-92; Ser-106; Thr-116; Thr-142; Ser-184; Ser-188; Ser-202; Ser-236; Thr-248; 

Ser-251; Ser-256; Thr-377. Two further useful fragments of the N protein (e.g. for 

immunoassay) are SEQ ID NOS: 9783 & 9784, which are lysine-rich and can be used to 

distinguish the SARS virus from other coiona;viruses. 

15 Predicted transmembrane regions of SEQ ID NO: 6051 are identified below. 

Inside to . outside helices ; 1 found 

from to score center 

304 ( 304) 323 ( 319) 495 312 

20 Outside to inside helices : 2 found 

from to score center 

304 ( 304) 319 ( 319) 597 312 

Accordingly, the invention includes a polypeptide comprising a fragment of SEQ ID NO: 
25 605 1 wherein said fragment does not include one or more of the hydrophobic amino add 

sequences identified above. The invention also includes a polynucleotide sequence encoding any 
of the above-identified polypeptides. 

Predicted protein localization of SEQ ID NO: 6051 is set forth below. SEQ ID NO: 6051 
is predicted to be localized near the nucleus, lysosome (lumen), mitochondrial mahix space, and 
30 microbody (peroxisome). The highest ranking is for localization near the nucleus. Coronavirus 
nucleocapsid proteins are known to bind to viral RNA. Coronavirus nucleocapsid proteins are 
also tbou^t to be important for ceU mediated immunity. Accordingly, the invention includes a 
polynucleotide comprising SEQ ID NO: 6051. Hie invention further includes a viral vector or 
particle suitable for in vivo delivm^ of the polynucleotide sequence comprising a SARS vims 
35 nuceocapsid polynucleotide sequence or a firagment thereof. In one embodiment, the 

polynucleotide comprises SEQ ID NO: 6051 or a fragment thereof. The invention further 
includes a method for eliciting a cell mediated immune response comprising delivering a 
polynucleotide encoding a SARS virus nucleocapsid protein or a fragment thereof to a mammal . 
Li one embodiment, the polynucleotide comprising SEQ ID NO: 6051 or a firagment thereof. 
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The invention further includes a method of screening SEQ ID NO: 6051 for an inhibitor. 
Hie invention includes the recombinant expression of SEQ ID NO: 6051 in a host ceU. Hie 
invention includes a small molecule which prevents the polypeptide of SEQ ID NO: 6051 ftom 
binding to SAKS virus RNA during viral replication. The invention includes a fusion protein 
5 whaein said fusion protein comprises SEQ ID NO: 6051. Predicted protein localization of SEQ 
ID NO: 6051 is set forth below. 

MOKS Prediction of Protein Localization Sites 

version 6 . 4 (WWW) 

Species classification: 4 

0 

*** Reasoning Step: 1 



Preliminary Calculation of ALOM (threshold: 0.5) 
count : 0 

15 McG: E x a m i n ing signal sequence (McGeoch) 

Length of UR: 3 

Peak. Value of UR: 0.19 

Net Charge of CR: 0 
Discriminant Score: -15.98 

20 GvH: Examining si^al sequence (von Heijne) 

Signal Score (-3.5): -6.36 
Possible cleavage site: 58 
»> Seems to have no N-teonninal signal seq. 

Amino Acid Conposition of Predicted Mature Form: 

25 • calculated from 1 

ALOM new cnt: 0 ** thrshld changed to -2 
Cleavable signal was detected in ALOM?: OB 
ALOM: finding transmembrane regions (Klein et ai.) 
coimt: 0 value: 5.04 threshold: -2.0 

30 PERIPHERAL Likelihood = 5.04 

modified ALOM score: -1.91 

Gavel: Examining the boundary of mitochondrial targeting seq. 
motif at: 17 
PRITPG 

35 Discrimination of mitochondrial target seq. : 
negative (-3.97) 



*** Reasoning Step: 2 
JO KDEL Count : 0 

Checking apolar signal for intramitochondrial sorting 
Mitochondrial matrix? Score: 0.10 

SKL motif (signal for peroxisomal protein) : 
pos: -1(399), count: 0 

1^5 Amino Acid Composition Tendency for Peroxisome: 0.04 

Peroxisomal proteins? Status: notclr 
AAC score (peroxisome) : 0.072 
Amino Acid Composition tendency for lysosomal proteins 
score: 0.96 Status: notclr 

>0 Modified score for lysosome: 0.246 

Checking the amount of Basic Residues (nucleus) 
Checking the 4 residue pattern for Nuclear Targeting 
Found: pos: 256 (4) KKPR 
Found: pos: 372 (5) KKKK. 

Checking the 7 residue pattern for Nuclear Targeting 
Checking the Robbins & Dingwall consensus (nucleus) 
Found: pos: 372 (3) KK KKTDEAQPLP QRQKK 
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Found: pos; 373 (3) KK KTDEAQPLPQ RQKKQ 

Pinal Robbins Score (nucleus): 0.80 

Checking the RNR binding motif (nucleus or cytoplasm) 
nuc modified. Score: 0.90 

Nuclear Signal Status: positive ( 0.90) 

Checking CaaX motif. . 

Checking N-rayristoylation. . 

Checking CaaX motif. . 



10 



15 



20 



25 



Final Results 

nucleus Certainty= 0.980(A££irmatlve) < suco 

lysosome (lumen) Certainty^ 0.246(A££innative) < suco 

mitochondrial matrix ^ace Certainty^ 0.100 (A££irmative) < suco 

microbody (peroxisome) Certainty^ 0.072 (Affirmative) < suco 

Predicted N-glycosylation sites of SEQ ID NO: 6051 are identified below. 

Position Potential Jury NGlyc 

agreement result 

48 NNTA 0.6879 (9/9) ++ (SEQ ID NO; 7270) 

270 NVTQ 0.7684 (9/9) +++ (SEQ ID NO: 7271) 



Residue No . Potential Threshold Assignment 



Thr 


166 


0.8547 


0.6439 


T 




Thr 


367 


0.5575 


0.5403 


T 




Thr 


394 


0.8217 


0.5821 


T 





Accordingly, the invention comprises a polypeptide comprising a fragment of the amino 
add sequence of SEQ ID NO: 6051 wharein said fragment comprises one or more of the N- 
glycosylation sites identified above. The invention further comprises a polynucleotide encoding 
)0 one or more of the polypeptides identified above. 

The invention further comprises a polypeptide comprising a fragment of amino acid 
sequence SEQ ID NO: 6051 wherein said fragment does not include one or more of the 

N-glycosylation sites identified above. The invention includes a polynucleotide encoding such a 
fragment 

^ for SEQ ID NO: 6052 are identified in Table 25. The invention includes a 

polypeptide for use as an antigen, wherein the polypeptide comprises: (a) an amiun acid 
sequence selected fiom the group consisting of the T-epitope sequences identified in SEQ ID 
NOS: 9539-9752; (b) an amino acid sequence having sequence identity to an amino add 
sequence of (a). The invention further comprising a polynucleotide sequence encoding the 
0 polypeptides of (a) or (b). The invention further comprising a method of expression or delivery 
of such polynucleotides through viral vectors and/or viral particles. ITie invention further 
comprises a polypeptide comprising two or more of the T-epitope sequences identified in SEQ 
ID NOS: 9539-9752, or a polynucleotide encoding such a polypeptide. 

A variant of SEQ ID NO: 6052 that is included within the invention is SEQ ID NO: 9964. 

5 Compared to SEQ ID NO: 6052, this sequence has Ee at residue 54 instead of Thr. 



- 70 - 




wo 2004/092360 



PCT/US2004/011710 



The use as an antigen is preferably a use: (1) as a T-cell antigen; (2) for genaating a 
complex between a class I MBC protein {e.g. a class I HLA) and a fragment of said antigen; (3) 
as an antigen for raising a cell-mediated immune response; and/or (4) as an antigen for raising a 
CTL response. Hie use preferably protects or treats disease and/or infection caused by a SARS 
5 virus. Hie invention provides the use of a polypeptide in the manufacture of a medicament for 
immunising a mammal (typically a human) against SARS viral infection wherein the polypeptide 
is as defined above. 

The invention provides a method of raising an immu ne response in a mammal (typically a 
human), comprising the step of administering to die mammal a polypeptide as defined above, 

10 wherein said inomune response is a cell-mediated immune response and, preferably, a CTL 
response. The immune response is preferably protective or therapeutic. 

The invention includes a composition comprising a SARS virus nucleocapsid protein or a 
fragment thereof and further comprising a SARS virus membrane protein or a fragment thereof. 
The composition may further comprising one or more adjuvants discussed below. 

15 The invention further includes a composition comprising a polypeptide comprising SEQ 

ID NO: 6051 or a fragment thereof or a sequence having sequence idraitity thereto and further 
comprising a polypeptide comprising SEQ ID NO: 6040, or a fragment thereof or a sequence 
having sequence identity thereto. Such composition may be used, for instance, in a vaccine. 

Such composition may further comprise one or more adjuvants discussed below. 

20 The invention includes a composition comprising a SARS virus nucleocapsid protein or a 

fragment thereof and a SARS virus spike protein or a fragment thereof. In one embodiment the 
nucleocapsid protein comprises a polypeptide sequence comprising SEQ ID NO: 605 1 or a 
fragment thereof or a sequence having sequence identity thereto, fii one embodiment, the spike 
protein comprises a polynucleotide comprising SEQ ID NO: 6042 or a fiagment thereof or a 
25 sequence having sequence identity thereto. The composition may further comprise one or more 
of the adjuvants discussed below. 

The invention further includes a composition comprising antibodies specific to a SARS 
vims nucleocapsid protein and comprising antibodies specific to a SARS virus spike protein. In 
one embodiment the antibody is specific to a nucleocapsid protein corrqirises a polypeptide 
JO sequence comprising SEQ ID NO: 605 1 or a fragment thereof or a sequence having sequence 

identity thereto. In one embodiment, the antibody is is specific to a spike protein comprises a 
polynucleotide comprising SEQ ID NO: 6042 or a fragment thereof or a sequence having 
sequence identity thereto. 

The invention further includes polynucleotide sequences, and fragments thereof, of a 
J5 SARS virus which are conserved among coronaviruses, and polypeptides encoded thereby. Such 
conserved sequences can be identified in the alignmraits shown in FIGURE 7. Such conserved 
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sequences may be used in the vaccines of the invention or in the diagnostic reagents, kits and 
methods of the invention. 

The invention further includes polynucleotide sequenices, and fragments thereof, of a 
SARS virus which are specific to SARS virus and not shared with coronaviruses. Such SARS 
5 specific sequwices are also identified as SEQ ID NOS: 6040, 6043, 6044, 6047, 6048, 6049 and 
6050. Such SARS specific sequences may be used in the vaccines of the invention or in the 
diagnostic reagents, kits and methods of the invention. 

Ihe invention also includes polynucleotide sequences which can be used as probes or 
primers for diagnostic reagents, kits (comprising such reagents) and methods which can be used 
10 to diagnose or identify the presence or absence of a SARS virus in a biological sample. The 

invention includes a polynucleotide sequence comprising one or more of the primer sequences 
identified inSEQ ID NOS: 6076-6265 (Table 5). The invention further includes polynucleotide 
sequence comprising the complement of one or mote of the primer sequences identified in SEQ 
ID NOS: 6076-6265. 

15 The invention also includes polynucleotide sequences which can be used as probes or 

prirhers for diagnostic reagents, kits (comprising such reagents) and methods which can be used 
to diagnose or identify the presence or absence of a SARS virus in a biological sample. The 
invention includes a polynucleotide sequence comprising one or mote of the primer sequences 
identified in SEQ ID NOS: 6266-6343 (Table 6). The invention further includes polynucleotide 
20 sequence comprising the complement of one or more of the primer sequences identified in SEQ 

ID NOS: 6266-6343. 

The invention also includes polynucleotide sequences which can be used as probes or 
primers for diagnostic reagents, kits (comprising such reagents) and methods which can be used 
to diagnose or identify the presence or absence of a SARS virus in a biological sample. The 
25 invention includes a polynucleotide sequence comprising one or more of the primer sequences 
identified in SEQ ID NOS: 6344-6392 (Table 7). The invention further includes polynucleotide 
sequence comprising the complement of one or more of the primer sequences identified in SEQ 
ID NOS: 6344-6392.. 

The invention also includes polynucleotide sequences which can be used as probes or 
10 primers for diagnostic reagents, kits (corrqjrising such reagents) arid methods which can be used 
to diagnose or identify the presence or absence of a SARS virus in a biological sample. Die 
invention includes a polynucleotide sequence comprising one or more of the primer sequences 
identified in SEQ ID NOS: 6393-6559 (Tables 8 & 9). The invention further includes 
polynucleotide sequence comprising the complement of one or more of the primer sequences 
>5 identified in SEQ ID NOS: 6393-6559. 
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The invention also includes polynucleotide sequ^ces which can be used as probes or 
primers for diagnostic reagents, kits (comprising such reagents) and methods which can be used 
to diagnose or identify the presence or absence of a S ARS virus in a biological samp le "nie 
invention includes a polynucleotide sequence comprising one or more of the primer and probe 
5 sequences identified in SEQ ID NOS: 6560-6568. The inv^tion further includes polynucleotide 
sequence con^)rising the con:q)lement of one or more of the primer sequences identified in SEQ 
ID NOS: 6560-6568. 

Die invention includes a polypeptide sequence con^nising any one of even-numbered SEQ 
ID NOS: 7272-7290, or a fragment thereof, or a sequence having sequence identity thereto. The 
10 invention finther includes a polynucleotide sequence encoding any one of even-numbered SEQ 
ID NOS: 7272-7290, or a fragment thereof, or a sequence having sequence identity thereto. 
Examples of such polynucleotide sequences are odd-numbered SEQ ID NOS: 7273-7291. 

The invention includes a polynucleotide sequence comprising an int^genic sequence 
which is common to each open reading finme of the SARS virus. The SARS virus is thought to 
15 use this sequence to signal translation of the open reading frame. The intergenic sequence 
comprises a IQmer SEQ ID NO: 7292, or optionally a hexamer SEQ ID NO: 7293. When the 
virus transcribes its positive (+) RNA strand to (-) RNA strand, the virus repUcating stracture 
uses the (-) strand template to transcribe nucleotides at the 5' end prior to the first intergenic 
sequence, followed by the intergenic sequence, foUowed by the selected open reading frame. 
iO Die virus flien creates multiple mRNAs comprising the 5' end, the intergenic sequence and 
coding sequence. For more details on Nidovriales replication (including Coronavims) see e.g., 
^ebuhr et al., “Virus-encoded proteinases and proteolytic processing in the Nidovirales”, 

Journal of General Virology 81:853-879 (2000), incorporated herein by reference in its entirety. 

Die invention comprising a polynucleotide sequence comprising SEQ ID NO: 7292 or the 
5 complement thereof. The invention comprising a polynucleotide sequence comprising SEQ ID 
NO. 7293 or the complement thereof. The invention further comprises a polynucleotide 
sequence comprising nucleotides from the 5’ end of the SARS viral genome, or its reverse 
complement, and further comprising an intergenic sequence or its reverse complement The 
polynucleotide may further comprise one or more of the SARS virus open reading frames. 

0 Examples of polynucleotide sequences comprising nucleotides finm the 5' end of the SARS virus 
genome followed by the intergenic sequence are SEQ ID NOS: 7294-7301.- 

The invention includes a polynucleotide sequence comprising a sequence selected from the 
group consisting of SEQ ID NO: 7292, SEQ ID NO: 7293, SEQ ID NO: 7294, SEQ ID NO: 

7295, SEQ ID NO: 7296, SEQ ID NO: 7297, SEQ ID NO: 7298, SEQ ID NO: 7299, SEQ ID 
5 NO: 7300 and SEQ ID NO: 7301, or a fragment thereof, or a sequence having sequence identity 
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fliereto. Li one embodiment, the polynucleotide does not consist entirely of a known S ARS virus 
sequence. 

The SARS virus intei]genic sequence can be used to create a RNAi molecule. Such a 
SARS virus specific RNAi molecule can be used to treat SARS virus infection. Hie invention 
5 includes a RNAi molecule comprising a double stranded RNA molecule wherein one RNA 

strand comprises a sequence selected from the group consisting of SEQ ID NO: 7292, SEQ ID 
NO: 7293, SEQ ID NO: 7294, SEQ ID NO: 7295, SEQ ID NO: 7296, SEQ ID NO: 7297, SEQ 
ID NO: 7298, SEQ ID NO: 7299, SEQ ID NO: 7300 and SEQ ID NO: 7301, or a fragment 
thereof. Preferably, said RNA strand comprises a sequence selected from the group consisting of 
10 SEQ ID NO: 7292 and SEQ ID NO: 7293. Preferably, the other RNA strand comprises the 

reverse complement of the first strand or a polynucleotide sequence which hybridizes to the first 
strand. 

The invention includes the use of RNAi in a method of treatment for SARS virus infection 
comprising administering to a mammal an effective amount of the si RNA molecule. Preferably, 
15. the RNAi molecule comprises the molecule described above. Further discussion of the RNAi 
applications of the intergenic sequence is included in section IV of the specification below. 

The invention also includes the use of a SARS virus antisense nucleotide sequence, 
preferably antisense directed to the SARS virus intergenic sequence. Such an antisense sequence 
may be used in the treatment of a subject infected with the SARS virus. The antisense of the 
20 SARS virus intergenic sequence can be designed to bind to the SARS viral polynucleotides to 
block access of the viral replication machinery to the int^genic sequence. Such an- antisense 
• sequence may also be used to identify the presence or absence of a SARS virus in a biological 
sample. The antisence can itself be labeled or the antisense associated with viral polynucleotides 
can be detected by means known in the art. 

25 Antisense nucleic acids are designed to specifically bind to RNA, resulting in the 

formation of RNA-DNA or RNA-RNA hybrids, with an arrest of DNA replication, reverse 
transcription or messenger RNA translation. Antisense polynucleotides based on a selected 
sequence can interfere with expression of the corresponding gene. Antisense polynucleotides 
will bind and/or interfere with the translation of the corresponding mRNA. 

^0 The invention also includes the use of the inter^nic region with a ribozyme. 

Trans-cleaving catalytic RNAs (ribozymes) are RNA molecules possessing 
endoribonuclease activity. Ribozymes are specifically designed for a particular target, and the 
target message must contain a specific nucleotide sequence. They are engineered to cleave any 
RNA species site-specifically in the background of cellular RNA. The cleavage event renders the 
J5 mRNA unstable and prevents protein expression. Importantly, ribozymes can be used to inhibit 
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expression of a gene of unknown function for the purpose of detomining its function in an in 
vitro or in vivo context, by detecting the phenotypic effect 

One commonly used libozyme motif is die hammer head, for which die substrate sequence 
requirements are minimal. Design of the hamme rhead ribozyme is disclosed in Usman et aL, 

5 Current Opin. Struct. Biol. (1996) 6:527-533. Usman also discusses the therapeutic uses of 
ribozymes. Ribozymes can also be prepared and used as described in Long et oL, FASEB J. 
(1993) 7:25; Symons, Ann. Rev. Biochem. (1992) 61:641; Perrotta et al., Biochem. (1992) 31:16- 
17; Ojwang et al., Proc. Natl. Acad. ScL (USA) (1992) 89:10802-10806; and US Patent 
5,254,678. Ribozyme cleavage of HIV-I RNA is described in US Patent 5,144,019; methods of 
10 cleaving RNA using ribozymes is described in US Patent 5,1 16,742; and methods for increasing 

the specificity of ribozymes are described in US Patent 5,225,337 and Koizumi et al. , Nucleic 
Add Res. (1989) 17:7059-7071. Preparation and use of ribozyme fragments in a hammerhead 
structure are also described by Koizumi et al., Nucldc Acids Res. (1989) 17:7059-7071.: 
Preparation and use of ribozyme fragments in a hairpin structure are described by Chowrira & 

15 Burke, Nucleic Acids Res. (1992) 20:2835. Ribozymes can also be made by rolling transcription 
as described inDaubendiek & Kool, Nat. Biotechnol. (1997) 15(3):273-277. 

• • The hybridizing region of the ribozyme may be modified or may be prepared as a branched 
structure as described in Horn & Urdea, Nucleic Acids Res. (1989) 17:6959-67. Tlie basic 
structure of the ribozymes may also be chemically altered in ways familiar to those skilled in the 
20 art, and chemically synthesized ribozymes can be administered as synthetic oligonucleotide 
derivatives modified by monomeric units. In a therapeutic context, liposome mediated delivery 
of ribozymes. improves cellular uptake, as described in Birildi et al.. Ear. J. Biochem. (1997) 
245:1-16. 

Therapeutic and functional genomic applications of ribozymes proceed begirming with 
25 knowledge of a portion of the coding sequence of the gene to be inhibited, fii the present- 

invention, the target sequence preferably comprises the intergeneic sequence of the SARS virus. 
Preferably, the sequence is selected from the group consisting of SEQ ID NO: 7292 and SEQ ID 
NO: 7293. A target cleavage site is selected in the target sequence, and a ribozyme is 
constmcted based on the 5' and 3* nucleotide sequences that flank the cleavage site. Preferably, 
30 the 5’ nucleotide sequence includes the 5’ untranslated region of the SARS virus. The ribozyme 
may then further be constructed from one or more of the polynucleotide sequences selected from 
the group consisting of SEQ ID NO: 7294, SEQ ID NO: 7295, SEQ ID NO: 7296, SEQ ID NO: 
7297, SEQ ID NO: 7298, SEQ ID NO: 7299, SEQ ID NO: 7300 and SEQ ID NO: 7301. 

Antisense treatment of HIV infection is described in the following references, each of 
35 which is incorporated herein by reference in their entirety, (antisense RNA complementary to 
the mRNA of gag, tat, rev, env) (Sezakiel et at, 1991, J. Virol. 65:468-472; Chatteijee et aL, 
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1992, Science 258:1485-1488; Rhodes era/., 1990, J. Gen. Virol. 71:1965. Rhodes etaL, 1991, 
AIDS 5:145-151; Sezakiel et al., 1992, J. Virol. 66:5576-5581; Joshi et aL, 1991, J. Virol. 
65:5524-5530). 

The invention includes the use of decoy RNA to disiupt the SARS virus replication and 
5 life cycle. Methods of making and using such decoy RNA for treatment of a viral infection are 
known in the art The invention includes delivery of genes encoding, for exarr^le, the SARS 
virus intergenic sequence, to infected cells. Preferably, the sequence comprises one or more of 
tire sequences selected from the group consisting of SEQ ID NO: 7292, SEQ ID NO: 7293, SEQ 
ID NO: 7294, SEQ ID NO: 7295, SEQ ID NO: 7296, SEQ ID NO: 7297, SEQ ID NO: 7298, 

10 SEQ ID NO: 7299, SEQ ID NO: 7300 and SEQ ID NO: 7301. Preferably, the sequence 

conq)rises one or more of the sequences selected from the group consisting of SEQ ID NO: 7292 
and SEQ ID NO: 7293. Preferably, the sequence comprises SEQ ID NO: 7293. 

Li the present invention, delivery of intergenic sequence which is not linked to the SARS 
virus open reading frames disrupts the translation process of the viral RNA and decreases the 
15 production of vial proteins. Similar methods of treatment for HIV viral irrfection have been 

described. The foUowing references discuss the use of decoy RNA of HIV TAR or RRE for 
treatment of HIV infection. Each of these references is incorporated herein by referende in their 
entirety. (SuUenger et al., 1990, Cell 63:601-608; Sullenger et al., 1991, J. Virol. 65:6811-6816; 
Lisziewicz et al., 1993, New Biol. 3:82-89; Lee et al., 1994, J. Virol. 68:8254-8264), ribozymes 
20 (Sarver et al., 1990, Science 247:1222-1225; Wecrasinghe et al., 1991, J. Virol. 65:5531-5534; 

, Dropulic et al., 1992, J. Virol. 66:1432-1441; Ojwang et al., 1992, Proc. Natl. Acad. Sci. USA. 
89:10802-10806; Yu et al., 1993, Proc. Natl. Acad. Sci. USA. 90:6340-6344; Yu et oL, 1995, 
Proc. Natl. Acad. Sci. USA. 92:699-703; Yamada et al., 1994, Gene Therapy 1:38-45). 

The invention includes the use of the SARS virus intergenic sequence in diagnostic 
25 reagents, kits (conq>rising such reagents) and methods which can be used to diagnose or identify 

the presence or absence of a SARS virus in a biological sample. Such diagnostic reagents, kits, 
and methods are further discussed in Section II of the specification. 

The invention includes a pair of primers for amplifying a SARS polynucleotide sequence 
coursing (i) a first primer comprising a sequence which is substantially identical to a portion 
30 of a sequence selected from the group consisting of SEQ ID NO: 7292, SEQ ID NO: 7293, SEQ 
ID NO: 7294, SEQ ID NO: 7295, SEQ ID NO: 7296, SEQ ID NO: 7297, SEQ ID NO: 7298, 
SEQ ID NO: 7299, SEQ ID NO: 7300 and S^ ID NO: 7301 and (ii) a second primer 
comprising a sequence which is substantially complementary to a portion of a sequence selected 
from the group consisting of the sequence SEQ ID NO: land the sequence SEQ ID NO: 2, such 
35 that the primer pair (i) and (ii) defines a tenq>late sequence within a sequence from the group 

consisting of the sequence SEQ ID NO: land the sequence SEQ ID NO: 2. Preferably, the (i) 
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first primpy comprises a sequCTce which is substantially identical to a portion of a sequence 
selected from the group consisting of SEQ ID NO: 7292 and SEQ ID NO; 7293. Preferably, the 
(i) first primer comprises a sequence which is substantially identical to a portion of the sequence 
of SEQ ID NO: 7293. The amplicon defined by said first and second primers is preferably 
5 between 50 and 250 nucleotides in length. Tlie primers may optionally be labeled to facilitate 
their detection. Methods and compositions for use in labeling primers are discussed further in 
the application in Section m. 

Tbe invention further includes a pair of primers for amplifying a SARS polynucleotide 
sequence comprising (i) a first primer com p ri sing a sequence which is substantially identical to a 
10 portion of the complement of a portion of a sequence selected fix)m the group consisting of SEQ 

ID NO: 7292, SEQ ID NO: 7293, SEQ ID NO: 7294, SEQ ID NO: 7295, SEQ ID NO: 7296, 
SEQ ID NO: 7297, SEQ ID NO: 7298, SEQ ID NO: 7299, SEQ ID NO: 7300 and SEQ ID NO: 
7301 and (ii) a second primer comprising a sequence which is substantially complementary to a ■ 
portion of the complement of a sequence selected from the group consisting of the sequence SEQ 
15 ID NO; land the sequence SEQ ID NO; 2, such that the primer pair defines a template sequence 
within a sequence selected- from the group consisting of the sequence SEQ ID NO: 1 and the 
sequence SEQ ID NO: 2. The amplicon defined by said first and second primers is preferably 
between 50 and 250 nucleotides in length. The primers may optionally be labeled to facilitate 
their detection.. Methods and compositions for use in labeling primers are discussed further in 
20 the application in Section m. ^ 

. The invention includes a kit comprising (i) a first primer comprising a sequence which is 
substantially idratical to a portion of a sequence selected from the group consisting of SEQ ED 
NO; 7292, SEQ ID NO: 7293, SEQ ID NO: 7294, SEQ ID NO: 7295, SEQ ID NO: 7296, SEQ 
ID NO: 7297, SEQ ID NO: 7298, SEQ ID NO: 7299, SEQ ID NO: 7300 and SEQ ID NO; 7301 
25 and (ii) a second primer comprising a sequmce which is substantially complementary to a 

portion of a sequence selected from the group consisting of the sequence SEQ ID NO: 1 and the 
sequ^ce SEQ ID NO: 2, such that the primer pair (i) and (ii) defines a template sequence within 
a sequence from the group consisting of SEQ ID NO: 1 and SEQ ID NO: 2. Preferably, the (i) 
first primer comprises a sequence which is substantially identical to a portion of a sequence 
30 selected from the group consisting of SEQ ID NO: 7292 and SEQ ID NO: 7293. Preferably, the 
(i) first primer comprises a sequence which is substantially identical to a portion of the sequence 
of SEQ ID NO: 7293. The primers may optionally be labeled to facilitate their detection. 
Methods and compositions for use in labeling primers are discussed further in the application in 
Section m. 

35 Other preferred kits comprise (i) a first primer comprising a sequence which is 

substantially identical to a portion of the complement of a portion of a sequence selected firom 
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the group co nsisting of SEQ ID NO: 7292, SEQ ID NO: 7293, SEQ ID NO: 7294, SEQ ID NO: 
7295, SEQ ID NO: 7296, SEQ ID NO: 7297, SEQ ID NO: 7298, SEQ ID NO: 7299, SEQ ID 
NO: 7300 and SEQ ID NO: 7301 and (ii) a second primer conopcising a sequence which is 
substantially complementary to a portion of the conq)lement of a sequence selected from the 
5 group consisting of SEQ ID NO: 1 and SEQ ID NO: 2, such that the primer pair defines a 
template sequence within a sequence selected from the group consisting of SEQ ID NO: 1 and 
SEQ ID NO: 2. 

Tlie invention furdro: includes an attenuated S ARS virus for use as a vaccine wherein the 
intergenic region has been mutated to reduce expression of the viral structural or nonstructural 
10 proteins. The attenuated SARS virus may comprises one or more additions, deletions or 

insertion in one or more of the intergenic regions of the viral genome. Preferably, the attenuated 
SARS virus comprises an addition, deletion or insertion in one or more occurrences of the 
sequence selected from the group consisting of SEQ ID NO: 7292 and SEQ ID NO: 7293. 
Preferably, the addition, deletion orinsertion occurs in one or more occurrences of SEQ ID NO: 
15 7293. 

The invention further comprises a small molecule which inhibits binding or association of 
the SARS viraJ replication machinery, such as a ribonucleoprotein, with the intergenic region of 
the viral genome. Preferably, the small molecule inhibits binding or association of the SARS 
viral machinery with a sequence selected fix»m the group consisting of SEQ ID NO: 7292 and 
20 SEQ ID NO: 7293. Preferably, the small molecule inhbiits binding or association of the SARS 
viral machinery with SEQ ID NO: 7293. Ibe invention further includes a mediod of screening 
for a -small molecule for treatmmit of SARS viral infection comprising using an assay to identify 
a small molecule which interferes with the association of the SARS viral replication machinery 
with the intergenic region of the SARS viral genome. 

25 The invention further provides a novel SARS polynucleotide sequence SEQ ID NO: 9968. 

AD six reading frames of this 690mer sequence are shown in Figure 113. The constituent amino 
acid sequences from Figure 113, having at least 4 amino acids, are listed as SEQ ID NOS: 9969 
to 10032. 

Accordingly the invention includes a polynucleotide sequence comprising SEQ ID NO: 

30 9968. It also provides polynucleotide sequences having sequence identity to SEQ ID NO: 9968. 

The degree of sequence identity is preferably greater than 50% (e.g., 60%, 70%, 80%, 85%, 

88%, 90%, 92%, 95%, 99% or more). 

Ihe invention includes an amino add sequence encoded by die polynucleotide sequence of 
SEQ ID NO: 9968, including the amino acid sequences selected fixim the group consisting of 
35 SEQ ID NO*: 9969 to 10032. Preferably, the amino acid sequence comprises SEQ ED NO: 9997 
or comprises SEQ ED NO: 9998. 
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The invention also provides amino acid sequences having sequence identity to an amino 
acid sequence encoded by SEQ ID NO: 9968. Tbe invoition provides amino acids having 
sequence identity to an amino add sequence selected from the group consisting of SEQ ID NO®: 
9969 to 10032. The degree of sequence identity is preferably greater than 50% (e.g., 60%, 70%, 
5 80%, 85%, 88%, 90%, 92%, 95%, 99% or more). 

A portion of SEQ ID NO: 9968 matches with approximately 98% identity to a previously 
published SAKS polynucleotide sequrace, commonly referred to as ‘BNI-l” (SEQ ID NO: 
10033). BNI-1 was sequenced at Bernhard Nocht Institute for Tropical Medicine, National 
Reference Center for Tropical Infectious Diseases in EEamburg, G^many. TTie BNI-1 sequence 
10 was published on the WHO website on April 4, 2003 at http://www.who,mi/csr/sars/prinxers/en and in 
Dorsten et al., ‘Idratification of a Novel Coronavirus in Patients with Severe Acute Respiratory 
Syndrome”, New England Journal of Medicine, published online at http://mnv.nejm.org on April 
10,2003. Both references are incorporated herein by reference in their entirety. The six reading 
frames of this 302mer sequence are shown in Figure 114 (see also Figure 129). The constituent 
15 amino acid sequences from Hgure 1 14, having at least 4 amino acids, are listed as SEQ ID NO®: 

10034 to 10065. An alignment of SEQ ID NO: 10034 with SEQ ID NO: 9997 is shown in Figure 
130; 

The invention provides for polynucleotide sequences comprising fragments of SEQ ID 
NO: 9968. In one embodiment, the fragment does not consist entirely of SEQ ID NO: 10033 or 
20 of a known coronavirus. 

. .The invention provides for armno acid sequences comprising fragments of an amino acid 
sequence encoded by SEQ ID NO: 9968. In one embodiment, the fragment does not consist 
entirely of an amino acid sequence encoded by SEQ ID NO: 10033 or a known coronavirus. 

The invention provides for amino acids comprising fragments of an amino acid sequence 
25 selected from the group consisting of SEQ ID NO®: 9969 to 10032. In one embodiment, the 
fragment does not consist entirely of an amino acid sequence encoded by the polynucleotide 
sequence of SEQ ID NO: 10033 or a known coronavirus. 

Approximately 100 nucleotides at the 5’ end of SEQ ID NO: 9968 do not match any 
portion of the BNI-1 polynucleotide sequence (SEQ ID NO: 10033). This unmatched portion is 
30 set forth as SEQ ID NO: 10066. The invention thus furth^ provides a polynucleotide comprising 
the sequence comprising SEQ ID NO: 10066, polynucleotide sequences having sequence 
identity to SEQ ID NO: 10066, or polynucleotide sequences com prisin g fragments of SEQ ID 
NO: 10066. 

The invention further comprises an amino acid sequence encoded by SEQ ID NO: 10066, 

35 an amino acid sequence having sequence identity to an amin o acid sequence encoded by SEQ ID 
NO: 10066, or an amino acid sequence comprising fragmraits of an amin o acid sequence 
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encoded by SEQ ID NO: 10066. PrefMably, the amino acid sequence comprises SEQ ID NO: 
10067. 

SEQ ID NO: 9997/9998 demonstrates homology with the a region of pollab of several 
coronaviruses. HGtlRE 115 shows an alignment of SEQ ID NO*: 9997/9998 to amin o acid 
5 sequences for pollab of bovine coronavirus (SEQ ID NO: 10068), avian infectious bronchitis 
' virus (SEQ ID NO: 10069) and murine hepatitis virus (SEQ ID NO: 10070). A consensus amin o 
acid sequence of SEQ ID NO*: 9997/9998, SEQ ID NO: 10068, SEQ ID NO: 10069, and SEQ 
ID NO: 10070 is shown in the bottom row of the alignment in Figure 115 (e.g. SEQ ID NO: 
10071). 

10 As shown in FIGURE 113, the polynucleotide sequence encoding SEQ ID NO: 9997 has a 

stop codon after codon 205, between SEQ ID NO*: 9997 and 9998. Optionally, the stop codon 
can be removed and the amin o add sequence continued (SEQ ID NO: 10072). Accordingly, the 
invention provides for an amino acid sequence comprising SEQ ID NO: 9997 and/or SEQ ID 
NO: 9998, or SEQ ID NO: 10072, and further comprising an amino acid sequence encoding for 
15 the C-terminus of a coronavirus pollab gene or a fragment thereof. 

As shown in FIGURE 115, SEQ ID NO*: 10068, 10069, 10070 and 10071 contain amino 
acids prior to the N-terminus of SEQ ID NO: 9997. The invention also provides for an amin o 
add sequence comprising SEQ ID NO: 9997 and further comprising an amino add sequence 
encoding for the N-terminus of a coronavirus pollab protein or a fragment thaeof. 

20 The pollab sequences on FIGURE 115 contain a coding region indicated on the 

schematic of FIGURE 1 17 by a In FIGURE 1 15, the beginning of this genomic region is 

designated by the arrow crossing in front of amino add 6080 of the consensus sequence SEQ ID 
NO: 10071. The end of this genomic region is designated by the arrow crossing in front of 
amin o add 6604 of die consensus sequ^ce. The invention provides for an amino add sequence 
25 comprising SEQ ID NO: 9997 and/or SEQ ID NO: 9998, or SEQ ID NO: 10072, and further 

comprising a first amino sequence prior to the N-terminus of said SEQ ID NO: 9997 and/or SEQ^ 
ID NO: 9998, or SEQ ID NO: 10072, wherein said first amino add sequence has homology to an' 
N-terminus sequence of a known coronaviras pollab protein or a fi'agment thereof. 

The invention further provides for an amino add sequence comprising SEQ ID NO: 9997 
30 and SEQ ID NO: 9998, wherein the stop codon after SEQ ID NO: 9971 is removed (ie. SEQ ID 
NO: 10072), and further comprising a second amino add sequence following the C tenminus of 
SEQ ID NO: 9998, wherein said second amino acid sequence is homologous with a C terminus 
of a known coronavirus pollab protein or a fragment thereof. 

Examples of such protdns are shown aligned in FIGURE 118, and are SEQ ID NO*: 

35 10073 to 10077. SEQ ID NO: 10073 comprises SEQ ID NO: 9997 and further comprises amino 
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adds prior to the N-teiminus and subsequent to the C-terminus from the pol lab protein of 
avian infectious bronchitis virus. SEQ ID NO: 10074 comprises SEQ ID NO: 9997 and further 
comprises amino adds prior to the N-terminus and subsequent to the C-terminus from the 
pollab protein of bovine coronavirus. SEQ ID NO: 10075 comprises SEQ ID NO: 9997 

5 and further comprises amino acids prior to the N-terminus and subsequent to the C-terminus 

from the pollab protein of murine hepatitis virus. SEQ ID NO: 10076 comprises SEQ ID 
NO: 9997 and further comprises amino adds prior to the N-terminus and subsequent to the 
C-terminus from the consensus of the pollab protein of avian infectious bronchitis virus, 
bovine coronavirus, and murine hepatitis virus (FIGURE 115). SEQ ID NO: 10077 comprises 
10 the consensus sequence of SEQ ID NOS: 10073 to 10076. 

The invention comprises an amino add sequence selected from the group consisting of 
SEQ ID NO^: 10073, 10074, 10075, 10076 and 10077. The invention further includes an aminn 
acid sequence comprising fragments of an amino acid sequence selected from the group 
consisting of SEQ ID NO®: 10073, -10074, 10075, 10076 and 10077. The invention further 
15 comprises an amino acid sequence with sequence identity to a sequence selected from the group 
consisting of SEQ ID NO®: 10073, 10074, 10075, 10076 and. 10077. 

The invention comprises polynucleotides encoding for the amino acid sequences selected 
from the group consisting of SEQ ID NO®: 10073, 10074, 10075, 10076 and 10077. The 
invention comprises polynucleotides having sequence identity to polynucleotides encoding for 
20 the amino acid sequraices selected from the group consisting of SEQ ID NO®: 10073, 10074, 
10075, 10076 and 10077. Hie invention comprises fragments of polynucleotides encoding SEQ 
ID NO®: 10073, 10074, 10075, 10076 and 10077. 

As shown in Figure 113, SEQ ID NO: 9968 includes a sequence that encodes SEQ ID NO: 
10020 foDowed by a stop codon, giving a C-terminus threonine (Thr) residue. The corresponding 
25 sequence from an amino acid sequence encoded by BNI-1 is SEQ ID NO: 10078, which 
continues past the C-terminus of SEQ ID NO: 10020. Accordingly, the invention incl ud es a 
protein con^irising amino acid sequence SEQ ID NO: 10020 or an amino acid sequence having 
sequence identity to SEQ ID NO: 10020 or an amino acid sequence comprising a fragment of 
SEQ ID NO: 10020, wherein the C-terminus residue of said protein is a threonine. Preferably, 

30 the C-terminus of said protein is -ST. StUl more preferably, the C-terminus of said protein is - 
EST. The invaition also includes a protein comprising amino acid sequence SEQ ID NO: 10078 
or an amino acid sequence having sequmice identity to SEQ ID NO: 10078 or an amino acid 
sequence comprising a fragment of SEQ ID NO: 10078, wherein the C-terminus residue of said 
protein is Thr. Preferably, the C-terminus of said protein is -ST. Still more preferably, the C- 
35 terminus of said protein is -EST. 
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SEQ ED NO: 9968 also encodes a-54mer amin o acid sequence SEQ ID NO: 10015. The 
polynucleotide encoding SEQ ID NO: 10015 encodes two stop codons at its C-terminus (Figure 
1 13). The conesponding region from die BNI-1 sequence does not contain this 54mra-. 
Accordingly, the invention includes a protein comprising amino acid sequence SEQ ID NO: 

5 10015, or an amino add sequence having sequence identity to SEQ ID NO: 10015 or an amin o 

add sequence comprising a firagment of SEQ ID NO: 10015. The invention further includes a 
polypeptide comprising SEQ ID NO: 10015 and further comprising a first amino acid sequence 
prior to the N-terminus of SEQ ID NO: 10015. 

SEQ ID NO: 9968 encodes the amin o add sequence SEQ ID NO: 9969. The 
10 polynucleotide sequ^ce contains a stop codon at the C-terminus of SEQ ID NO: 9969. 

Accordingly, the invention includes a protein comprising amino add sequence SEQ ED NO: 
9969, or an amino add sequence having sequence identity to SEQ ID NO: 9969. The invention 
further includes a polypeptide comprising SEQ ED NO: 9969 and further comprising a first 
amino add sequence prior to the N-terminus of SEQ ID NO: 9969. The invention further 
15 includes a pplypeptide comprising the sequence SEQ ID NO: 10079. 

SEQ ED NO: 9968 encodes amino acid sequence QRT (Figure 1 13), followed by a stop, 
codon. Accordingly, the invention includes a protein comprising amino acid sequence QRT. The 
invention further includes a polypeptide comprising amino acid sequence QRT and further 
comprising a first amino add sequence prior to the N-terminus of the sequence QRT. 

20 . SEQ ID NO: 9968 encodes amino acid sequence SEQ ED NO: 10022, followed by a stop 

codon at its C-terminus. Accordingly, &e invention includes a protein comprising amin o add 
sequence SEQ ID NO: 10022, or an amino acid sequence having sequence identity to SEQ ID 
NO: 10022. The invention further includes a polypeptide comprising SEQ ID NO: 10022 and 
further comprising a first amino add sequmce prior to the N-terminus of SEQ ED NO: 10022. 

25 SEQ ID NO: 9968 encodes amin o acid sequence SEQ ED NO: 10027. Within the SEQ ID 

NO: 10027 coding sequence there are at least three start codons, identified with underlining in 
Hgure 119. The open reading frame indicated by the first start codon is SEQ ID NO: 10081. The 
open reading frame indicated by the second start codon is SEQ ID NO: 10082. The open reading 
fiame indicated by the third start codon is SEQ ID NO: 10083. 

30 The invention provides a novel SARS polynucleotide sequence SEQ ID NO: 10084. All 

six reading frames of this 1463mer sequence are shown in Figure 120 (see also Figure 122). The 
constituent amino acid sequences from Kgure 120, having at least 4 amino acids, are listed as 
SEQ ID NOS: 10085 to 10209 (see Figures 120A to 120F). 

The invention includes a polynucleotide sequence comprising SEQ ID NO: 10084. The 
35 invention also provides polynucleotide sequences having sequence identity to SEQ ED NO: 

10084. The invention also provides for polynucleotide sequences comprising fragments of SEQ 
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ID NO: 10084. In one embodiinent, the polynucleotide fragment does not consist entirely of 
SEQ ID NO: 10033 or a known coronavirus polynucleotide sequence or a known SARS 
polynucleotide sequence. 

The invention includes an amino acid sequence encoded by the polynucleotide sequence of 
5 SEQ ID NO: 10084, including the amino acid sequences of Figures 120A to 120F e.g. selected 

c 

from the group consisting of SEQ ID NO : 10085 to 10209. Preferably, the amino acid sequence 
comprises SEQ ID NO: 10149. 

The invention also provides amino acid sequences having sequence identity to an amin o 
acid sequence encoded by SEQ ID NO: 10084. The invention provides amin o acids having 
10 sequence identity to an amino add sequence from Hgures 120A to 120F e.g. selected from the 

group consisting of SEQ ID NO*: 10085 to 10209.' 

The invention also provides fragments of amino acid sequences encoded by SEQ ID NO: 
10084. The invention also provides fragments of amino acid sequences selected from die group 
consisting of SEQ ID NO*: 10085 to 10209. In one embodiment, the firagment does not consist 
15 entirely of an amin o add sequence encoded by SEQ ID NO: 10033 or an amino acid sequence of 

a known coronavirus or an amino add sequence of a known SARS virus. An alignment of the 
matching portion of SEQ ID NO: 10033 and SEQ ID NO: 10084 is included in FIGURE 121. 

. . In one embodiment, the invention comprises an amino acid sequraice comprising SEQ ID 
NO: 10149. An alignment of the polynucleotide sequence SEQ ID NO: 10084 to the encoded 
20 SEQ ID NO: 10149 is shown in FIGURE 122 (5'3' Frame 3). Analysis of the 5'3' Frame 3 

translation by, a computer program to predict s^ codon methionines (NetStart 1.0) ^GURE 
123) reveals SEQ ID NO*: 10210 to 10215. 

The invention includes a protein comprising an amino acid sequence selected from the 
group consisting of SEQ ID NO: 10210, SEQ ID NO: 10211, SEQ ID NO: 10212, SEQ ID NO: . 
25 10213, SEQ ID NO: 10214 and SEQ ID NO: 10215. The invention includes a protdn having 

sequence identity to an amino acid sequence selected from the group consisting of SEQ ID NO: 
10210, SEQ ID NO: 10211, SEQ ID NO: 10212, SEQ ID NO: 10213, SEQ ID NO: 10214 and 
SEQ ID NO: 10215. In one embodiment, the protein does not consist entirely of an amino acid 
sequence of a known SARS virus or of a known coronavirus. 

30 The invention includes a fragment of a protein comprising an amino acid sequence selected 

from the group consisting of SEQ ID NO: 10210, SEQ ID NO: 10211, SEQ ID NO: 10212, SEQ 
ID NO: 10213, SEQ ID NO: 10214 and SEQ ID NO: 10215. In one embodiment, the fragment 
does not consist entirely of an amino acid sequence of a known SARS virus or of a known 
coronavirus. 

35 In one embodiment, the invention includes a polypeptide comprising the amino acid 

sequence selected from the group consisting of SEQ ID NO: 10210, SEQ ID NO: 10211 and 
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SEQ ID NO: 10212. Partial results of a BLAST of SEQ ID NO: 10210 against GenBank is 
included in HGURE 124. These results indicate that SEQ ID NOS: 10210, 1021 1 and 10212 
have functional similarities to a Coronavirus RNA polymerase, particularly the RNA polymerase 
of murine hepatitis virus, bovine coronavirus, and avian infectious bronchitis. 

5 In one embodiment, the invention is directed to a polypeptide comprising a first amino add 

sequ^ce selected fi»m the group consisting of SEQ ID NO: 10210, SEQ ID NO: 1021 1 and 
SEQ ID NO: 10212 and a second amino add sequence from the C-terminus of a coronavirus 
ORFlab sequence. Preferably, the second amino add sequence is fix>m a bovine coronavirus. 
One example of this embodiment is shown below as SEQ ID NO: 10216. Amin o adds 1-481 of 
10 SEQ ID NO: 10216 are the first amino acid sequence of SEQ ID NO: 10210, and amin o adds 
482-1 152 are the second amin o add sequence of the C-terminus of a bovine coronavirus 
orf lab polyprotein (Gi 26008080) (NP_150073.2) (SEQ ID NO: 10217). 

■ Accordingly, the invention includes a polypeptide comprising SEQ ID NO: 10216. The 
invention further includes a polypeptide comprising a first amino add sequence of SEQ ID NO: 
15 10210 and a second amin o add sequence of SEQ ID NO: 10217. The invention further includes- 

a polypeptide comprising a first amino acid sequence having greater than identity to SEQ ID 
NO: 10210 and a second amino add sequence having greater than y% identity to SEQ ID NO: 
10217, wherein x is greater than or equal to 85% (e.g., 86%, 87%, 88%, 89%, 90%, 91%, 92%, 
93%, 94%, 95%, 96%, 97%, 98%, 99% or more) and wherein y is greater than or equal to 60% 

20 (g.g., 65%, 70%, 75%, 80%, 85%, 90%, 95%, 98%, 99% or more). 

The invention also includes a polypeptide comprising a fragment of SEQ ID NO: 10210, 
wherein said fragment includes an epitope. Computer-predicted epitopes of SEQ ID NO: 10210, 
using a 17mer window, are included in FIGURE 125A (Hopp & Woods) and FIGURE 125B 
(Kyte & Doolittle). 

25 The amino add sequence of SEQ ID NO: 10210 also contains two predicted glycosylation 

sites at amino adds 81-84 (NNTE; SEQ ID NO: 10218) and at 180-183 (NHSV; SEQ ID NO: 
10219). Accordingly, the' invention includes a polypeptide conq)rising a fragment of SEQ ID 
NO: 10210, wherein said fragment includes a glycosylation site. The invention further includes 
a polypeptide comprising a fragment of SEQ ID NO: 10210, wherein said fragment includes the 
30 Asn at position 81. Preferably, said Asn is glycosylated. The invention further includes a 

polypeptide comprising a fragment of SEQ ID NO: 10210, whaein said fiugment includes the 
Asn at position 180. Preferably, said Asn is glycosylated. 

In one embodiment, the invention includes a polypeptide comprising an amino add 
sequence finm within Rgure 120D and/or SEQ ID NO^: 10150 to 10160 e.g. from SEQ ID NO^: 
35 10154, 10155, 10158 and 10160. Within SEQ ID NO: 10154 the following amino acid sequences 

starting with a Met and ending at a stop codon can be identified: SEQ ID NO^: 10220 to 10227. 
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Accordingly, the invention includes a polypepticte comprising an amino acid sequence 
selected ficom the group consisting of SEQ ID NO: 10220, SEQ ID NO: 10221, SEQ ID NO: 
10222, SEQ ID NO: 10223, SEQ ID NO: 10224, SEQ ID NO: 10225, SEQ ID NO: 10226 and 
SEQ ID NO: 10227, or a fiagment thereof or an amino acid sequence having sequence identity 
5 thereto. 



10 



15 



20 



25 



In one embodiment, the invention includes a polypeptide comprising the amin o acid 
sequence within Figure 120E e.g. from SEQ ID NO®: 10161 to 10182, and in particular SEQ ID 
NOS: 10171 and 10176. Within SEQ ID NO®: 10171 and 10176 the following amino acid 
sequences starting with a Met and ending at a stop codon can be identified: SEQ ID NO: 10228 
and SEQ ID NO: 10229. 

Accordingly, the invention includes a polypeptide comprising an amino acid sequence 
selected from the group consisting of SEQ ID NO: 10228 and SEQ ID NO: 10229, or a firagment 
thereof or an amino add sequence having sequence identity thereto. 

In one embodiment, the invention includes a polypeptide comprising an amino acid 
sequence from Figure 120F e.g. SEQ ID NO®: 10183 to 10209. Within Figure 120F the 
following amino add sequence starting with a Met and ^ding at a stop codon can be identified- 
SEQ ID NO: 10187. Accordingly, the invention includes a polypeptide comprising an amino 
add sequence of SEQ ID NO: 10187, or a fragment thereof or an amin o acid sequence having 
sequence identity thereto. 

Li one embodiment, the polynucleotides of the invention do not include one of the 



following primers, disclosed at http://content.nejnuorg/cgi/reprint/NEJMoa03078lv2ipdf: 



5 ' GGGTTGGGACTATCCTAAGTGTGA3 ’ 
5 ' TAACaav,aVACICCATCATCA3 ' 

5 ' CTAACATGCTTAGGATAAT(3G3 • 

5 ’ GCCTCTCTTGTTCTTGCTCGC3 • 

5 ’ CAGGTAAGCGTAAAACTCATC3 * 



(SEQ ID NO: 10230) 
(SEQ ID NO: 10231) 
(SEQ ID NO: 10232) 
(SEQ ID NO: 10233) 
(SEQ ID NO: 10234) 



The invention also includes polynucleotide sequences which can be used as probes for 
diagnostic reagents, kits (comprising such reagents) and methods which can be used to diagnose 
30 or identify the presence or absence of a S ARS virus in a biological sample. The invention 

includes the polynucleotide primers identified in Table 31 (SEQ ID NO®: 10235 to 10258), the 
forward primers SEQ ID NO®: 10259 to 10281 and the reverse primers SEQ ID NO®: 10282 to 
10298. The invention further includes polynucleotide sequences which are complementary to 
any one of these primer sequences disclosed herein. 

35 The invention provides a SARS polynucleotide sequence SEQ ID NO: 10299. AH six 

reading frames of this sequence are included in FIGURE 126 (See also Figure 131). TTie 
constituent amino acid sequences from Hgure 126, having at least 4 amino acids, are listed as 
SEQ ID NOS: 10300 to 10337. 
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Accordingly, the invention includes a polynucleotide ^uence comprising SEQ ID NO: 
10299. It also provides polynucleotide sequences having sequence identity to SEQ ID NO: 
10299. Hie invention also provides for polynucleotide sequences comprising fragments of SEQ 
ID NO: 10299. In one embodiment, the polynucleotide fragmrat does not consist entirely of a 
5 known polynucleotide sequence of a SARS virus or a known polynucleotide sequence of a 

coronavirus. 

The invention includes an amin o acid sequence encoded by the polynucleotide sequence of 
SEQ ID NO: 10299, including the amino acid sequences shown in Hgure 126, and the amin o 
acid sequences selected from the group consisting of SEQ ID NO^: 10300 to 10337. Preferably, 
10 the amino acid sequence comprises SEQ ID NO: 10316. 

The invention also provides amin o acid sequences having sequence identity to an amino 
acid sequence encoded by SEQ ID NO: 10299. The invention provides amino acid sequences 
having identity to an amino acid sequence selected from the group consisting of SEQ ID NO^: 
10300 to 10337. 

15 ■ The invention also provides firagments of amino acid sequences encoded by SEQ ID NO: 

10299. The invention also provides fragments of amin o acid sequences selected from the group 
consisting of SEQ ID NO*: 10300 to 10337. In one embodiment, the fragment does not consist 
entirely of a known amino acid sequence of a SARS virus or a known amino acid sequence of a 
coronavirus. 

20 : In one embodiment, the invention comprises an amino acid sequence comprising SEQ ID > 

NQ: 10316. i&icoded open reading frames within SEQ ID NO: 10316 include SEQ ID NO: 
10338 and SEQ ID NO: 10339. 

In one embodiment, the invention comprises an amino acid sequence comprising a 
sequence firom within the 53' Frame 1 translation of SEQ ID NO: 10299. The following encoded 
25 open reading frame is found within this translation: SEQ ID NO: 10340. 

hi one embodiment, the invention conqirises an amino acid sequence comprising a 
sequence from within the 3'5* Rame 1 translation of SEQ ID NO: 10299. An encoded open 
reading frame within this translation is SEQ ID NO: 10341. 

In one embodiment, the invention comprises an amino acid sequratce comprising a 
30 sequence from within the 3'5’ Frame 2 translation of SEQ ID NO: 10299. An encoded open 

reading frame within this translation is SEQ ID NO: 10342. 

The invention includes a polypeptide comprising an amin o acid sequence selected from the 
group consisting of SEQ ID NO: 10338, SEQ ID NO: 10339, SEQ ID NO: 10340, SEQ ID NO: 
10341 and SEQ ID NO: 10342. The invention includes a polypeptide having sequence identity 
35 to an amino acid sequence selected from the group consisting of SEQ ID NO: 10338, SEQ ID 
NO: 10339, SEQ ID NO: 10340, SEQ ID NO: 10341 and SEQ ID NO: 10342. The invention 
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15 



includes a fragment of a polypeptide comprising an amino add sequence elected &om the group 
consisting of SEQ ID NO: 10338, SEQ ID NO: 10339, SEQ ID NO: 10340, SEQ ID NO: 10341 
and SEQ ID NO: 10342. In one embodiment, the fragment does not consist entirely of a known 
SARS vims amino add sequence or of a known coronavirus amino add sequence. 

In one embodiment, SEQ ID NOS: 10338-10342 are used in fusion proteins. Accordingly, 
the start codon methionines may be removed. The invention comprises a amino add sequence 
selected from the group consisting of SEQ ID NO: 10343, SEQ ID NO: 10344, SEQ ID NO: 
10345, SEQ ID NO: 10346 and SEQ ID NO: 10347. 



Li one embodiment, the invention comprises an amino acid sequence selected ftom the 
group consisting of SEQ ID NO: 10338 and SEQ ID NO: 10339. Partial BLAST results of SEQ 
ID NO: 10338 against GenBank are given below: 



ot Q7Q1A? RNA-DIRECTED RNA POLimERASE (ORFlB) 

gi|93934|pir||A43489 RNA-directed RNA polymerase (EC 2.7.7.48) - porcine 
transmissible gastroenteritis virus (fragment) * ^ 

^,MB33161|e»b|CM37284.1| polywrase g.stroantarltis 



Length = 533 



:o 



5 



0 



3 



Score = 131 bits (329), Expect = 3e-30 

Identities = 55/89 (61%), Positives = 69/89 (77%), Gaps = 1/89 (1%) 



Query: 

Sbjct: 



1 . J^jrcKDGHVETFyPKLQASQAWQPGVAMPNLYKMQRMLLEKCDLQNYGENAVIPKGIMMN 

MLWC++ H++TFYP+LQ+++ W PG +MP LYK+QRM LE+C+L NYG +P GI N 

MLWCENSHIKTFYPQLQSAE-WNPGYSMPTLYKIQRMCLERCNLYNYGAQVKLPDGITTN 



60 

275 



Query: 

Sbjct: 



61 VAKYTQLCQYIJmjTLAVPSNMRVIHFGA 89 
V KYTQLCQYLNT TL VP MRV+H GA 
276 VVKYTQLCQ3nCJmTl,CVPHKMRVI.HIX3A 304 



These results indicate that SEQ ID NO: 10338 has functional sirmlarities to an RNA- 
directed RNA polymerase of porcine transmissible gastroenteritis vims. 

Partial BLAST results of SEQ ID NO: 10339 against GenBank are given below: 

>gb|AAL57305.1 I replicase (bovine coronavirus] 

Length = 7094 



Score = 139 bits (351), Es^ct = 7e-33 

Identities = 64/108 (59%) , Positives = 78/108 (72%) 



Query: 1 ^SKVVKVTIDYAEISFMLWCKDGHVETFYPKLQASQAWOPGVAMPMI.YKMQ Wmtt.i:^ 60 

^ “ +SKW V +EH- + FMLWC D V TFYP+LQA+ W+PG +MP LYK +E+ 

6819*^' ^^*®^®^'''''**VNVDPKDPQPMLWaroEKVm-FYPRLQAASDWKPGYSMPVLY^ 



Query: 61 CDLQNYGENAVIPKGIMMNVAKYTQLCQYIiimjTLAVPSNMRVIHFGA 108 

u ^ ® MMNVAKYTQLCQYLNT TLAVP NMRV+H GA 

SbDct: 6820 VSLWNYGKPVTLPTGCMMNV 2 UCYIQLCQYLimTLAVPVl»mVU 6867 

These results indicate that SEQ ID NO: 10339 has functional similarities to a repUcase of 
bovine coronavirus. 
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The SARS virus may contain polymorphism at the Glu-20 residue of SEQ ID NO: 10338. 
The invention includes a polypeptide comprising an amino add sequence having sequence 
identity to SEQ ID NO: 10338, wherein said polypeptide includes an amino acid sequence 
selected from the group consisting of ASQAW (SEQ ID NO: 10348) and ASRAW (SEQ ID NO: 

5 10349). Hie invention includes a fragment of a polypeptide comprising SEQ ID NO: 10338, 

wherein said fragment includes an amino add sequence selected from the group consisting of 
SEQ ID NO: 10348 and SEQ ID NO: 10349. 

Hie SARS virus may contain polymorphism at the Ser-80 residue of SEQ ID NO: 10338. 
below. The invention includes a polypeptide comprising an amin o acid sequence having 
10 sequence identity to SEQ ID NO: 10338, wherein said polypeptide includes an amino add 

sequence selected from the group consisting of VPSNM (SEQ ID NO: 10350) and VPTNM (SEQ 
ID NO: 10351). The invention includes a fragment of a polypeptide comprising SEQ ID NO: 
10338, wherein said fragment includes an amino acid sequence selected from the group 
consisting of S^ ID NO: 10350 and SEQ ID NO: 10351. 

15 The invention also includes polynucleotide sequences which can be used as probes for 

diagnostic reagents, kits (comprising such reagents) and methods which can be used to diagnose 
or identify the presence or absence of a SARS virus in a biological sample. The invention 
includes a polynucleotide sequence comprising one or more of the primer sequences identified in 
Table 32. The invention further includes polynucleotide sequence comprising the complement of 
20 one or more of the primer sequences identified in Table 32. 

The invention provides a SARS polynucleotide sequence SEQ ID NO: 10505. All six 
reading frames of this se^ence are shown in Figure 127 (see also Figure 132). The constituent 
amino add sequences from Rgure 127, having at least 4 amino acids, are listed as SEQ ID NOS: 
10506 to 10570. 

25 Hie invention includes a polynucleotide sequence comprising SEQ ID NO: 10505. The 

invention also provides polynucleotide sequences having sequence identity to SEQ ID NO: 
10505. The invention also provides for polynucleotide sequences comprising fragments of SEQ 
ID NO: 10505. In one embodiment, the polynucleotide fragment does not consist entirely of a 
known SARS virus polynucleotide sequence or of a known coronavirus polynucleotide 
30 sequence. 

The invention includes an amino add sequence encoded by the polynucleotide sequence of 
SEQ ID NO: 10505, including the amino add sequences shown in Figure 127, and particularly 
those selected from the group consisting of SEQ ID NO®: 10506 to 10570. Preferably, the amino 
add sequence comprises SEQ ID NO: 10532 and/or SEQ ID NO: 10533. 

35 The invention also provides amino add sequences having sequence identity to an amin o 

add sequence encoded by SEQ ID NO: 10505. The invention provides amino add sequences 
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having sequence identity to an amino acid sequence selected ftom the group consisting of the 
sequences shown in Kgure 127, and in particular SEQ ID NO®: 10506 to 10570. 

The invention also provides fragments of amino acid sequences encoded by SEQ ID NO: 
10505. The invention also provides fragments of amino acid sequences selected from the group 
5 consisting of SEQ ID NO®: 10506 to 10570. hi one embodiment, the fragment does not consist 
entirely of a kuown amino add sequence of a SARS virus or a known amino add sequence of a 
coronavirus. 



10 



5 



0 



In one embodiment, the invention includes a polypeptide composing an amino acid 



sequence from the 33' Ptame 3 of Hgme 127. Some encoded open reading fiames within this 
translation are: SEQ ID NO: 10533; SBQ ID NO: 10571; SEQ ID NO: 10572; SEQ ID NO- 
10573; SEQ ID NO: 10574. 

The invention includes a polypeptide sequence comprising an amino acid sequence 
selected from the group consisting of SEQ ID NO: 10533. SEQ ID NO: 10571, SEQ ID NO: 
10572, SEQ ID NO: 10573 and SEQ ID NO: 10574. The invention includes a polypeptide 
having sequence identity to the amino acid sequence selected from the group consisting of SEQ 
ID NO: 10533, SEQ ID NO: 10571, SEQ ID NO: 10572, SEQ ID NO: 10573 and SEQ ID NO: 
10574. The invention includes a fragment of a polypeptide sequence comprising an amino acid 



sequence selected from the group consisting of SEQ ID NO: 10533. SEQ ID NO: 10571, SEQ 

ID NO: 10572. SEQ ID NO: 10573 and SEQ ID NO: 10574. 

Partial BLAST results of SEQ ID NO: 10533 against GenBank are given below 

"-I-cpsid p„t,in 

Length =451 



5 



) 



I 



Score = 147 bits (370), Expect = 3e-34 

Identities = 102/252 (40%), Positives = 137/252 (54%), Gaps = 18/252 (7%) 



Query: 49 
Sbjct: 63 



-fTQ K -PE +F +GQGVPI + +q gy-i- R RR + D6+ k+L PRW 

SWFSGITQFQKGKEPQPAQGQGVPlASGIPASEQKGyvmUINRRSFKTPI^^ 122 



Sbjct: 123 TfFYYLGTCPHAGAEYGDDIEGVVWVASCXJADTKTTADWERDPSSHEMP^ 182 



Query: 167 ^FJJA^SRGGSQMSRSSSRSRGNSRNSTPGSSRGNSPARMASGGGETALALIJ^ 226 

Ttiiri + AS S N SS PA 

Sbjct: 183 QGFYVEGSGRSAPASRSGSRSQSRGPNNRARSSSNQRQPASAVKPDMAEEIAALVIAKLG 242 



Query: 

Sbjct: 



227 QLESKVSGK^QQQGQTVTKKSAABASK KPRQKRTATKQYNVTQAFGRRGPEQTQG 282 

0/.7. w VTK+SA E + KPRQKRT KQ V Q PG+RGP 0 

243 K--- DAGQPKQ ^VTKQSAKBVRQKILTKPRQKRTPNKQCPVQQCPGKRGPNQ 290 



Query: 283 NFGDQDLIRQGT 294 
NFG ++++ GT 

Sbjct: 291 NFGGSEMLKLGT 302 
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20 



25 



30 



35 



«) 



15 



)0 



)5 



iO 



1 3132999 1 gb I AAC16422 . 1 1 nucleocapsid pirobein [muxine hepafcifcxs virus 

strain 2] 

Length = 451 

Score = 147 bits (370), Expect = 3e-34 

Identities = 102/252 (40%), Positives = 137/252 (54%), Gaps = 18/252 (7%) 

Query: 49 SWPTALTQHGK-EELRPPRG(3GVPINTNSGPDDQ1GYYRRATRR-VRGGD6KMKELSPRW 106 
SWP+ K +E +F +GQGVPI + +Q GY+ R RR + DG+ K+L PRW 

Sbjct: 63 SWFSGITQFQKGKEFQFAQGQGVPIASGIPASEQKGYWYRHNRRSFKTPDGQHKQLLPRW 122 

Query: 107 YFYYLGTGPEASLPYGANKEGIVWVATEGftLIJTPKDHIGTRNPNNNAATVLQLPQGTTLP 166 
YFYYI/3TGP A YG + EG+VWVA++ A + R+P+++ A + GT LP 

Sbjct: 123 YFYYIiGTGPHAGABYGDDIEGWWVASQQADTKTTADWHlDPSSHEAIPTKFAPGTVLP 182 

Query: 167 KGFYAEGSRGGSQASSRSSSRSRGNS^STPGSSRGNSPARMASGGGETALALLLLDRLN 226 
+GFY EGS + AS S N SS PA +A L+L +L 

Sbjct: 183 QGFYVEGSGKSAPASRSGSRSQSRGPNNRARSSSNQRQPASAVKPDMAEEIAALVLAKLG 242 

Query: 227 QLESKVSGKGQQQQGQTVTKKSAAEASK KPRQKRTATKQYNVTQAFGRRGPEQTQG 282 

+ GQ +Q VTK+SA E + KPRQKRT KQ V Q FG+RGP Q 

Sbjct: 243 K DAGQPKQ VTKQSAKEVRQKILTKPRQKRTPNKQCPVQQCFGKRGPNQ 290 

Query: 283 NFGDQDLIRQGT 294 
NFG ++++ GT 

Sbjct: 291 NFGGSEMLKL(3T 302 



>gi|127877|sp|P03417 |nCAP_CVMJH Nucleocapsid protein 
gij74859|pirj IVHIHMJ nucleocapsid protein - murine hepatitis virus 

(strain JHM) 

gi|58973|emb|CAA25497.l| nucleocapsid protein [Murine hepatitis virus] 
Length = 455 

Score = 146 bits (369), Expect = 4e-34 

Identities = 110/254 (43%), Positives = 142/254 (55%), Gaps = 22/254 (8%) 

Query: 49 SWFTALTQHGK-EEIjRFPRG(JGVPINTNSGPDDQIGYYRRATRR-VRGGDGKMKELSPRW 106 
SWF+ +TQ K +E +F +GQGVPI Q GY+ R RR + DG+ K+L PRW 

Sbjct: 67 SWFSGITQFQKGKEFQFAQGQGVPIANGIPASQQKGYWYRHNRRSFKTPDGQQKQLLPRW 126 

Query: 107 YFYYLGTGPEASLPYGANKEGIVWVATE(3ALNTPKDHIGTRNPNNNAATVLQLPQGTTLP 166 
YFYYLGTGP A YG + EG+VWVA++ A I R+P+++ A + GT LP 

Sbjct: 127 YFYYIiGTGPYAGAEYGDDIEGWWVASQQAETRTSADIVERDPSSHEAIPTRFAPCTVLP 186 

Query: 167 KGFYAEGSRGGSQASSRSSSR—SRffllSRNSTPGSSRGNSPARMASGGGErALALLLLDR 224 
+GFY EGS G S +SRS SR SRG N SS PA +A L+L + 

Sbjct: 187 QGFYVEGS-GRSAPASRSGSRPQSRG-PNNRARSSSNQRQPASTVKPDMAEEIAALVLAK 244 

Query: 225 LNQLESKVSGKGQQQQGQTVTKKSAAEASK KPRQKRTATKQYNVTQAFGRRGPEQT 280 

L + GQ +Q VTK+SA E + KPRQKRT KQ V Q FG+RGP Q 

Sbjct: 245 LGK DAGQPKQ VTKQSAKEVRQKILNKPRQKRTPNKQCPVQQCFGKRGPNQ- 294 

Query: 281 Q(3NFGDQDLIRQGT 294 
NFG ++++ GT 

Sbjct: 295 — NFGGPEMLKLGT 306 
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>gi 1 6625766 1 gb| AAF19389 . 1 1 AF201929_7 nucleocapsid protein [murine 

hepatitis virus strain 2] 

gi 1 7769348 1 gb| AAF69338 . 1 1 AF208066_11 nucleocapsid protein [murine 

hepatitis virus] 

Length = 451 



Score = 146 bits (368), Expect = 5e-34 

Identities = 102/252 (40%), Positives = 137/252 (54%), Gaps = 18/252 (7%) 



Query: 


49 


Sbjct : 


63 


Query: 


107 


Sbjct: 


123 


Query: 


167 


Sbjct: 


183 


Query: 


227 


Sbjct: 


243 


Query: 


283 


Sbjct : 


291 



SWFTALTQHGK-EELRFPRGQGVPIimTSGPDDQIGYYRRATRR-VRGGDGKMKELSPRW 
SWF+ +TQ K +E +F +GQGVPI + +Q GY+ R RR + DG+ K+L PRW 

SWFSGITQFQKGKEFQFAQGQGVPIASGIPASEQKGYWYRHNRRSFKTPDGQHKQLLPRW 

YFYYLGTGPEASLPYGANKEGIVWVATEGAIiNTPKDHIGTRNPNNNAATVIjQLPQGTTLP 
YFYYLCTGP A YG + EG+VWVA++ A + R+P+++ A + GT LP 

YFYYLGTGPHAGAEYGDDIEGWWVASQQADTKTTADWERDPSSHEAIPTRFAPGTVLP 

KGFYAEGSRGGSQASSRSSSRSRGNSRNSTPGSSRGNSPARMASGGGETALALLLLDRLN 
+GFY EGS + AS S N SS PA +A L+L +L 

QGFYVEGSGRST^ASRSGSRSQSRGPNNRARSSSNQRQPASAVKPDMAEEIAALVLAKLG 

QLESK7SGKGQQQQGQTVTKKSAAEASK KPRQKRTATKQYNVTQAFGRRGPEQTQG 

+ GQ +Q VTK+SA E + KPRQKRT KQ V Q FG+RGP Q 

K DAGQPKQ VTKQSAKEVRQKILTKPRQKRTPNKQCPVQQCFGKRGPNQ- — 

NFGDQDLIRQGT 294 
NFG ++++ GT 
NFGGSEMLKLGT 302 



106 

122 

166 

182 

226 

242 

282 

290- 



35 



40 



45 



50 



55 



60 



>gi 1 21734854 | gb | AAM77005 . 1 |AF481863_7 phosphorylated nucleocapsid • protein 
N [porcine hemagglutinating encephaloinyelitis virus] 

Length =449 

Score = 145 bits (366), Expect = 8e-34 

Identities = 107/253 (42%), Positives = 145/253 (57%), Gaps = 18/253 (7%) ‘ 



Query: 


49 


Sbjct: 


64 


Query: 


107 


Sbjct: 


124 


Query: 


166 


Sbjct: 


183 


Query: 


226 


Sbjct : 


237 


Query: 


282 


Sbjct: 


294 



SWFTALTQH<^--EELRFPRGQGVPINTNSGPDDQIGYYRRATRR-VRGGDGKMKELSPRW 
SWF+ +TQ K +E F GQGVPI + GY+ R RR + DG ++L PRW 
SWFSGITQFQKGKEFEFAEGQGVPIAPGVPATEAKGYWYRHNRRSPKTADGNQRQLLPRW 

YFYYLGTGPEASLPYGANKEGIWVATE6A-LNTPKDHIGTROTNNNAATVLQLPQGTTL 
YFYYLGTGP A YG + +G+ WVA+ A +NTP D I R+P+++ A + P GT L 
YFYYLGTGPHAKHQYGTDIDGVFWVASNQADINTPAD- IVDRDPSSDEAIPTRFPPGTVL 



PKGFYAEGSRGGSQASSRSSSRSRGNSRNSTPGSSRGNSPARMASGGGETALALLLLDRL 
P+G+Y EGS G S +SRS+SR+ N S SR NS R ++ G +A D++ 
PQC3YYIEGS -GRSAPNSRSTSRA- PNRAPSAGSRSRANSGNRTSTPGVTPDMA DQI 

NQLESKVSGKGQQQQGQTVTKKSAAEASK KPRQKRTATKQYNVTQAFGRRGPEQTQ 

L GK + Q VTK++A E + KPRQKR+ KQ V Q FG+RGP Q 
ASLVLAKLGK-DATKPQQVTKQTAKEVRQKILNKPRQKRSPNKQCTVQQCFGKRGPNQ— 



GNFGDQDLIRQGT 294 
NFG ++++ GT . 
-NFGGGEMLKLGT 305 



106 

123 

165 

182 

225 

236 

281 

293 
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>gi|23295765 |gb|AAIj80036.1| nucleocapsid protein [porcine 

hemagglutinating eucephaloinyelitis vimis] 

Iiength = 449 

Score = 145 bits (365) , Expect = le-33 

Identities = 107/253 (42%), Positives = 145/253 (57%), Gaps = 18/253 (7%) 

(Juery: 49 SWFTALTQHGK-BELRPPRGQGVPINTNSGPDDQrGYYRRATRR-VRGGDGKMKELSPRW 106 
SWP+ +TQ K +E P GQGVPI + GY+ R RR + DG ++L PRW 

Sbjct: 64 SWFSGITQPQKGKEPEPAEGQGVPIAPGVPSTEAKGYWYRHNRRSFKTADGNQRQLIiPRW 123 

Query: 107 YFYYLGTGPEASLPYGANKEGIVWVATEGA-IiNTPKDHIGTRNPNNNAATVIiQIiPQGrrD 165 
YFYYLGTGP A YG + +G+ WVA+ A +NTP D I R+P+++ A + P GT L 
Sbjct: 124 YFYYIjGTGPHAKDQYGTDIDGVFWVASNQADINTPAD-IVDRDPSSDEAIPTRFPPGTVIi 182 

Query: 166 PKGFYAEGSRGGSQASSRSSSRSRGNSHNSTPGSSRGNSPARMASGGGETALAIjLIiLDRIi 225 
P+G+Y EGS G S +SRS+SR+ N S ' SR NS ' R ++ G +A D++ 
Sbjct: 183 PQGYYIEGS-GRSAPNSRSTSRA-PNRAPSAGSRSRANSGNRTSTPGVTPDMA DQI 236 

Query: 226 NQLESKVSGKGQQQQGQTVTKKSAAEASK KPRQKRTATKQYNVTQAFGRRGPEQTQ 281 

L GK + Q VTK++A E + KPRQKR+ KQ V Q P6+RGP Q 
Sbjct: 237 ASLVLAKLGK-DATKPQQVTKQTAKEVRQKILNKPRQKRSPNKQCTVQQCPGKRGPNQ— 293 

Query: 282 GNFGDQDIiIRQGT 294 
NPG ++++ GT 

Sbjct: 294 -NPGGGEMLKLGT 305 



These results indicate that SEQ ID NO: 10533 has functional similarities to a coronavirus 
nucleocapsid protein. 

30 In one embodiment, the invention comprises an amino acid sequence from the 5'3' Frame 1 

of Rgure 127 e.g. SEQ ID NO^: 10506-10514. Some encoded open reading frames wi thin this 
region are SEQ ID NO^: 10575 to 10578. 

Accordingly, the invention includes a polypeptide comprising the amino acid sequence 
selected from the group consisting of SEQ ID NO: 10575, SEQ ID NO: 10576, SEQ ID NO: 

35 10577 and SEQ ID NO: 10578. The invention includes a polypeptide comprising an amino acid 

sequence having sequence identity to a sequence selected from the group consisting of SEQ ID 
NO: 10097, SEQ ID NO: 10576, SEQ ID NO: 10577 and SEQ ID NO: 10578. The invention 
includes a fragment of a polypeptide comprising an amino acid sequence selected from the group 
consisting of SEQ ID NO: 10097, SEQ ID NO: 10576, SEQ ID NO: 10577 and SEQ ID NO: . 
40 10578. 

In one embodiment, the invention includes a polypeptide comprising an amino acid 
sequence from the 3'5* Frame 2 of Rgure 127 e.g. SEQ ID NO®: 10547-10559. An open reading 
frame within this region is SEQ ID NO: 10579. 

The invention includes a polypeptide comprising an amin o add sequence of SEQ ID NO: 
45 10579. The invention includes a polypeptide comprising an amino add sequence having 

sequence identity to SEQ ID NO: 10579. The invention includes a fra gm ent of a polypeptide 
comprising an amino add sequence of SEQ ID NO: 10579. 
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Hie invention also includes polynucleotide sequences which can be used as probes for 
diagnostic reagents, kits (comprising such reagents) and methods which can be used to diagnose 
or identify the presence or absence of a SARS virus in a biological sample. The invention 
includes a polynucleotide sequence comprising one or more of the primer sequences identified in 
5 Table 33. The invention further includes polynucleotide sequence comprising the complement 
of one or more of the primer sequences identified in Table 33. 

Hie invention includes a poljmucleotide sequence comprising SEQ ID NO: 11323. A 
polypeptide encoded by SEQ ID NO: 11323 is SEQ ID NO: 11324. 

The invention includes a polypeptide comprising SEQ ID NO: 1 1324, sequence having 
10 sequence identity to SEQ ID NO: 11324 and fragments of SEQ ID NO: 11324. The invention 
includes a firagment of SEQ ID NO: 1 1324, wherein said polypeptide fragment begins with a 
Methionine. 

Accordingly, the invention includes a polynucleotide sequence com prising SEQ ID NO: 
11323. It also provides polynucleotide sequences having sequence identity to SEQ ID NO: 

15 11323. The invention also provides for polynucleotide sequences comprising fragments of SEQ • 

ID NO: 1 1323. hi one embodiment, the polynucleotide firagment does not consist entirely of a 
known SARS polynucleotide sequence or a known coronavirus polynucleotide sequence. 

The invention includes an amino acid sequence encoded by the polynucleotide sequence 
SEQ ID NO: 11323, including the ainino acid sequence of SEQ ID NO: 11324. 

20 Hie invention also provides amino acid sequences having sequence identity to an amino 

acid sequence encoded by SEQ ID NO: 11323. The invention provides amino acid sequences 
having sequence identity to SEQ ID NO: 1 1324. 

The invention provides firagments of amino acid sequences encoded by SEQ ID NO: 

11323. The invention also provides firagments of amino acid sequences of SEQ ID NO: 11324. 
25 In one embodiment, the fragment does not consist entirely of a known SARS amino acid 

sequence or a known coronavirus amino acid sequence. 

The invention also includes polynucleotide sequences which can be used as probes for 
diagnostic reagents, kits (comprising such reagents) and methods which can be used to diagnose 
or identify die presence or absence of a SARS virus in a biological sample. The invention 
30 includes a polynucleotide sequence comprising one or more of the primer sequences identified as 

SEQ ID NO^: 11325-11440 (left part) and SEQ ID NO^: 11441-11551 (right part). The invention 
further includes polynucleotide sequence comprising the complement of one or more of the 
primer sequences identified as SEQ ID NO^: 11325-11551. 

The invention includes a polypeptide comprising SEQ ID NO: 11552. The SARS virus 
35 contains polymorphism at the Isoleucine residue Ile-324. The invention includes a polypeptide 
comprising an amino acid sequence having sequence identity to SEQ ID NO: 11552, wherein 
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said polypeptide includes an amino acid sequence selected from the group consisting of YSYAI 
(SEQ ID NO: 11553), SYAIH (SEQ ID NO: 11554), YAIHH (SEQ ID NO: 11555), ihhdk 
(SEQ ID NO: 11556), SYAI (SEQ ID NO: 11557), YAIH (SEQ ID NO: 11558), AIHH (SEQ ID 
NO: 11559). IHHD (SEQ ID NO: 11560), YAI, AIH, and IHH. The invention includes a 
5 fragment of a polypeptide comprising SEQ ID NO: 11552, wherein said fragment includes an 
amino acid sequence selected from the group consisting of YSYAI (SEQ ID NO: 11553) 

SYAIH (SEQ ID NO: 1 1554), YAIHH (SEQ ID NO: 1 1555), IHHDK (SEQ ID NO: 1 1556), 
SYAI (SEQ ID NO: 11557), YAIH (SEQ ID NO: 11558), AIHH (SEQ ID NO: 11559), ihhd 
(SEQ ID NO: 11560), YAI, AIH, and IHH. 

10 The invention includes a polypeptide comprising an amino acid sequence selected fix>m the 

group consisting of SEQ ID NO: 11561 and SEQ ID NO: 11562. Tfre invention includes a 
fragment of a polypeptide comprising an amino acid sequence selected from the group consisting 

of SEQ ID NO: 11561 and SEQ ID NO: 11562. 

The invention includes a diagnostic kit comprising a polypeptide comprising at least one of 
15 the amino acid ^uences selected from the group consisting of SEQ ID NO®: 11561 and 1 1562. 
Hie invention includes a diagnostic kit comprising a polynucleotide sequence encoding a 
polypeptide comprising at least one of the amino acid sequences selected from the group 
consisting of SEQ ID NO®: 11561 and 11562. The invention includes an immunogenic 
composition comprising a polypeptide comprising at least one of the amino add sequences 

« selectedfromthegroupconsistingofSEQIDNO®: 11561 and 11562. The invention includes an 

antibody which recognizes a polypeptide comprising at least one of the amino acid sequences 
selected from the group consisting of SEQ ED NO®: 1 1561 and 11562. 

The invention includes a polynucleotide sequence SEQ ID NO: 1 1563 or a fragment 
thereof or a sequence having sequence identity thereto. Polypeptide sequences which can be 
5 translated from SEQ ID NO: 11563 are shown in Rgure 128. The constituent amino acid 

sequences from Figure 128, having at least 4 amino acids, are listed as SEQ ID NO®- 11564 to 
11617. 

The invention includes a polypeptide sequence selected from the group consisting of the 
sequences of Hgure 128, or a fragment thereof or a sequence having sequence identity thereto 
3 e.g. SEQ ID NO®: 11563 to 1 1617. 

A polypeptide sequence within SEQ ID NO: 1 1600 is SEQ ID NO: 1 1618. The invention 
includes a polypeptide comprising SEQ ED NO: 1 1618, or a fragment thereof or a sequence 
having sequence identity thereto. 
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A polypeptide sequence within SEQ ID NO: 11602 is SEQ ID NO: 11641. Ite invention 
includes a polypeptide comprising SEQ ID NO: 11641, or a fragment thereof or a sequence 
having sequence identity thereto. 

A polypeptide sequence within SEQ ID NO: 11609 is SEQ ID NO: 11619. 

S Ihe invention includes a polynucleotide encoding (i) an amin o acid sequence selected from 

the group consisting of: (1) the amino acid sequ»ices of Hgure 128, and in particular SEQ ID 
NO^: 11564-11617; (2) SEQ ID NO: 11618; and (3) SEQ ID NO: 11619, or (ii) a fragment 
thereof. The invention includes a diagnostic kit comprising a one or more of these proteins. The 
invention includes a diagnostic kit comprising a polynucleotide sequence encoding one or more 
10 of these polypeptide sequences. The invention includes an antibody which recognizes one or 

more of the polypeptide sequences. 

The SARS virus may contain polymorphism at isoleucine residue Ile-326 in SEQ ID NO: 
11620 (Chi-PEP3). The invention includes a polypeptide comprising an amino acid sequence 
having sequence identity to SEQ ID NO: 1 1620, wherein said polypeptide includes an amino 
15 acid sequence select from the group consisting of YAIHH (SEQ ID NO: 1 1621) and YAiniH 

(SEQ n> NO: 11622). The invention includes a fragment of a polypeptide comprising SEQ ID 
NO: 11620, wherein said fragmrat includes an amino acid sequence selected from the group 
consisting of YAYHH (SEQ ID NO: 1 1621) and YATHH (SEQ ID NO: 1 1622). 

The SARS virus may contain polymorphism at glutamine residue Gln-830 in SEQ ID NO: 
20 1 1620. The invention includes a polypeptide comprising an amino acid sequence having 

sequence identity to SEQ ID NO: 1 1620, wherein said polypeptide includes an amino acid 
sequence selected from the group consisting of ASgAW (SEQ ED NO: 11623) and ASRAW (SEQ 
ID NO: 1 1624). The invention includes a fragment of a polypeptide comprising SEQ ID NO: 
11620, wherein said fragment includes an amin o acid sequence selected from the group 
25 consisting of ASgAW (SEQ ID NO: 11623) and ASJ^W (SEQ ED NO: 1 1624). 

The SARS virus may contain polymorphism at aspartic acid residue Asp-935 in SEQ ID 
NO: 11620. The invention includes a polypeptide comprising an amin o acid sequence having 
sequence identity to SEQ ED NO: 11620, wherein said polypeptide includes an amin o acid 
sequence selected from the group consisting of DADST (SEQ ED NO: 1 1625) and DAYST (SEQ 
30 ED NO: 1 1626). The invention includes a fragment of a polypeptide comprising SEQ ED NO: 
11620, wherein said fragment includes an amino acid sequence selected from the group 
consisting of DADST (SEQ ED NO: 11625) and DAYST (SEQ ED NO: 11626). 

The SARS virus may contain polymorphism at serine residue Ser-577 in SEQ ED NO: 
11627 (Chi-PEP4). Hie invention includes a polypeptide comprising an amino acid sequence 
35 having sequence identity to SEQ ED NO: 1 1627, wherein said polypeptide includes an amino 
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add sequence selected fiom the group consisting of PCSFG (SEQ ID NO: 11628) and PCAFG 
(SEQ ID NO: 1 1629). The invention includes a fragment of a polypeptide comprising SEQ ID 
NO: 11627, wherein said fragment includes an amino add sequence selected from the group 
consisting of PCSFG (SEQ ID NO: 11628) and PC^G (SEQ ID NO: 11629). 

5 The SARS virus may contain polymorphism at valine residue Val-68 in SEQ ID NO: 

11630 (Chi-PEP8). The invention includes a polypeptide comprising an amino add sequraice 
having sequence identity to SEQ ID NO: 11630, wherein said polypeptide includes an amino 
add sequence selected from the group consisting of LAT^ (SEQ ID NO: 11631) and LAAVY 
(SEQ ID NO: 11632). The invention includes a fragment of a polypeptide comprising SEQ ID 
10 NO: 1 1630, wherein said fragment includes an amino add sequence selected fiom the group 
consisting of LAVVY (SEQ ID NO: 11631) and LAAVY (SEQ ID NO: 11632). 

The SARS virus may contain polymorphism at isoleudne residue Be-50 in SEQ ID NO: 
11633 (Chi-PEP13). The invention includes a polypeptide comprising an amino acid sequence 
having sequence identity to SEQ ID NO: 11633, wherein said polypeptide includes an amino 
15 add sequence selected fiom the group consisting of NNXAS (SEQ ID NO: 11634) and NNTAS 

(SEQ ID NO: 11635). The invontion includes a fragment of a polypeptide comprising SEQ ID 
NO: 11633, wherein said fragment includes an amino add sequence selected fiom the group 
consisting of NN]yu3 (SEQ ID NO: 1 1634) and NNTAS (SEQ ID NO: 11635). 

The SARS virus may contain a polymorphism at Serine residue Ser-943 in SEQ ID NO: 

20 1 1636. The invdition includes a polypeptide comprising an amino acid sequence having 

sequence identity to SEQ ID NO: 11636, wherein said polypeptide includes an amino add 
sequence selectwi fiom the group consisting of AVSAC (SEQ ID NO: 11637) and AV^G (SEQ 
ID NO: 1 1638). The invention includes a fragment of a polypeptide comprising SEQ ID NO: 
11636, wherein said fragment includes an amino add seuence selected fiom the group consisting 
25 of AVSAC (SEQ ID NO: 11637) and AVGAC (SEQ ID NO: 11638). 

The invention includes a polynucleotide SEQ ID NO: 11639, or a fiagment thereof or a 
sequence having sequence identity thereto. The invention includes a polypeptide encoded by the 
polynucleotide sequence set forth in SEQ ID NO: 11639, or a fragment thereof or a polypeptide 
sequence having sequence identity thereto. 

50 The invention includes a polynucleotide set forth in SEQ ID NO: 1 1640, or a fragment 

thereof or a sequence having sequence identity thereto. The invention includes a polypqrtide 
encoded by the polynucleotide sequence set fordi in SEQ ID NO: 1 1640, or a firagment thereof or 
a polypeptide sequence having sequence identity thereto. 

The invention includes each of the polynucleotides identified above. The invention 
»5 includes each of the polynucleotides set forth in the sequence listing. The inv«ition further 
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includes polynucleotides having sequence identity to each of the polynucleotides identified 
above. Ihe degree of sequence identity is pieforably greater dian 50% (e.g., 60%, 61%, 62%, 
63%, 64%, 65%, 66%, 67%, 68%, 69%, 70%, 71%, 72%, 73%, 74%, 75%, 76%, 77%, 78%, 
79%, 80%, 81%, 82%, 83%, 84%, 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 

5 95%, 96%, 97%, 98%, 99% or more). 

The invention includes polynucleotide sequences comprising fragments of each of the 
polynucleotide sequences identified above. The fragments should comprise at least n 
consecutive polynucleotides from a particular SEQ ID NO:, and, depending on the sequence, n is 
7 or more (e.g., 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 

10 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40, 45, 50, 55, 60, 65, 70, 80, 90, 100, 120, 130, 140, 

150, 160, 170, 180, 190, 200, 210, 220, 230, 240, 250, 260, 270, 280, 290, 300 or more). 

. [^e invention includes each of the amin o acid sequences encoded by each of the 
polynucleotide sequCTces identified above. The invention includes each of the amino acid 
sequenc^ encoded by each of the polynucleoti^ sequences set forth in the sequence listing. 

15 The invention further includes amino acid sequences having sequence identity to the amino acid 
sequences encoded by each of the polynucleotide sequences identified above. The degree of 
sequence identity is preferably greater than 50% (e.g . , 60%, 61%, 62%, 63%, 64%, 65%, 66%, 
67%, 68%, 69%, 70%, 71%, 72%, 73%, 74%, 75%, 76%, 77%, 78%, 79%, 80%, 81%, 82%, 
83%, 84%, 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 

20 99% or more). The invention further includes fragments of amin o acid sequences encoded by 

each of the polynucleotide sequences identified above. The fragments should comprise at least n 
consecutive amino acids from a particular SEQ ID NO:, and, depending on the sequence, n is 7 
or more (e.g., 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 
30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40, 45, 50, 55, 60, 65, 70, 80, 90, 100, 120, 130, 140, 150, 
25 160, 170, 180, 190, 200, 210, 220, 230, 240, 250, 260, 270, 280, 290, 300 or more). 

The invention includes each of the amino acid sequences identified above. The invention 
includes each of the amino add sequence set forth in the sequence listing. The invention further 
includes amino acid sequences having sequence identity to each of the aminn acid sequences 
identified above. The degree of sequence identity is preferably greater than 50% (e.g., 60%, 

30 61%, 62%, 63%, 64%, 65%, 66%, 67%, 68%, 69%, 70%, 71%, 72%, 73%, 74%, 75%, 76%, 

77%, 78%, 79%, 80%, 81%, 82%, 83%, 84%, 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 
93%, 94%, 95%, 96%, 97%, 98%, 99% or more). 

The invention further includes fragments of the amino add sequences identified above. 

The fragments should comprise at least n consecutive amino acids from a particular SEQ ID 
35 NO:, and, depending on the sequence, n is 7 or more (e.g., 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 

18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40, 45, 50, 55, 
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60, 65, 70, 80, 90, 100, 120, 130, 140, 150, 160, 170, 180, 190, 200, 210, 220, 230, 240, 250, 

260. 270. 280. 290. 300 or moie). 

The invention includes polynucleotides encoding each of the amino acid sequences 
identified above. The invention includes polynucleotides encoding each of the amino acid 
5 sequences set forth in the sequence listing. The invention further includes polynucleotides 
having sequence identity with each of the polynucleotides encoding each of the amino add 
sequences identified above. Hie degree of sequence identity is prefranbly greater than 50% (c.g., 
60%, 61%, 62%, 63%, 64%, 65%, 66%, 67%, 68%, 69%. 70%, 71%, 72%, 73%, 74%, 75%, 
76%, 77%, 78%, 79%, 80%, 81%, 82%, 83%, 84%, 85%, 86%, 87%, 88%, 89%, 90%, 91%, 

10 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99% or more). 

The invention further includes fiagments of polynucleotides encoding each of the amino 
acid sequences identified above. The fragments should comprise at least n consecutive 
polynucleotides from a particular SEQ ID NOr, and, deprading on the sequence, /t is 7 or more 
(e.g, 7, 8, 9, 10, 11, 12, 13. 14. 15, lfU7, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28. 29. 30, 31, 

15 32. 33. 34, 35, 36, 37, 38, 39. 40, 45, 50, 55, 60, 65, 70, 80. 90. 100. 120, 130, 140. 150, 160, 

170. 180. 190. 200. 210. 220. 230. 240. 250. 260. 270. 280. 290. 300 or more). 

As described in more detail below, polynucleotides for use as primers and/or as probes 
may contain at least 4 or 8 contiguous nucleotides fix>m a polynucleotide sequence of the 
invention e.g. at least 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24. 25, 26, 27, 28, 29, or 30 
20 contiguous nucleotides and up to about 50, 75, 100, 200 contiguous nucleotides or more. While 
6-8 nucleotides may be a workable length, sequences of 10-12 nucleotides are preferred, and 
about 13, 14, 15, 16, 17, 18, 19, 20, or 21 or more nucleotides or more appears op timal for 
hybridisation. 



In one embodiment, the invention is directed to polynucleotides and amino acid sequences 
25 that do not consist entirely of a known SARS virus polynucleotide or amino acid sequenbe or of 
a known coronavirus polynucleotide or amino acid sequence. In one embodiment, the 
polynucleotides and amino acid sequences of the invention do not consist entirely of the 
sequence SEQ ID NO: 1. In another embodiment, the polynucleotides and amino acid sequences 
of the invention do not consist entirely of the sequence SEQ ID NO: 2. SEQ ID NO: 9967 is a 
W SARS genome sequence of the Frankfurt (FRA) isolate (GenBank: AY3 10120). Compared to 
SEQ ID NO: 1, it differs at nucleotides 2546, 2590, 11437, 18954, 19073, 20585, 20899, 23209, 
24922, 26589 & 28257; compared to SEQ ID NO:2, it differs at nucleotides 2560, 7922, 11451, 
16625, 18968 & 19067. Furdier genome sequences have become available from GenBank, since 
this application was originally filed, under accession numbers including AY559097, AY559096, 
i5 AY559095, AY559094. AY559093, AY559092, AY559091. AY559090, AY559089. AY559088, 
AY559087. AY559086, AY559085, AY559084, AY559083, AY559082, AY559081, AY2741 19, 
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AY323977, AY291315, AY502932, AY502931, AY502930, AY502929, AY502928, AY502927, 
AY502926, AY502925, AY502924, AY502923, AY291451, AY390556, AY395003, AY395002, 
AY395001, AY395000, AY394999, AY394998, AY394997, AY394996, AY394995, AY394994, 
AY394993, AY394992, AY394991, AY394990, AY394989, AY394987, AY394986, AY394985, 

5 AY394983, AY394979, AY394978, AY508724, AY394850, AY463059, AY463060. AY313906, 

AY310120, AY461660, AY485278, AY485277, AY345988, AY345987, AY345986, AY282752, 
AY357076, AY357075, AY350750, AY304495, AY304488, AY304486, AY427439, AY283798, 
AY278491, AY278489, AY362699, AY362698, AY283797, AY283796, AY283795, AY283794, 
AY278741, AY351680, AP006561, AP006560, AP006559, AP006558, AP006557, AY278554, 

10 AY348314, AY338175, AY338174, AY321118, AY279354, AY278490, AY278487, AY297028, 

AY278488, and NC_004718. 

In another embodiment, the invention is directed to polynucleotides that encode proteins 
which are not immunologically cross reactive with a protein of a mouse hepatitis virus, a bovine 
coronavirus or an avian infectious bronchitis virus. In another embodiment, the invention is 
1-5 directed to protmns which are not immunologically cross reactive with a protein of a mouse 

hepatitis virus, a bovine coronavirus or an avian infectious bronchitis virus. 

Each of the polynucleotides identified above may be used to encode a portion of a fusion 
protein. Accordingly, the invention complies one or more of the polynucleotides identified 
above wherein the polynucleotides encoding for the start codon are removed. The invention 
20 further comprises one or more of the amino acids identified above wherein the starting 
methionine is removed. 

Any of the polynucleotide or amino acid sequences discussed above may be used in 
vaccines for die treatment or prevention of SARS virus infection, including as a SARS viral 
antigen. Additionally, any of die poljmucleotides or amin o acid sequences discussed above may 
25 be used as diagnostic reagents, or in kits (comprising such reagents) or in methods used to 
diagnose or identify the presence or absence of a SARS virus in a biological sample. 

SARS viral antigens of the invention may include a polypeptide with 99%, 95%, 90%, 
85%, or 80% homology to one or more of the group consisting of the following proteins: 
nonstmctural protein 2 (NS2); hemaggjutinin-esterase glycoprotein (HE) (also referred to as E3), 
30 spike glycoprotmn (S) (also referred to as E2), nonstmctural region 4 (NS4), envelope ( small 
membrane) piotdn (E) (also referred to as sM), membrane glycoprotein (M) (also referred to as 
El), nucleoc^sid phosphoprotein (N) or RNA dependent RNA polymerase ^1). 

A detailed discussion of Coroavirus biology can be found in Fields Virology (2nd ed), 
Reids et al. (eds.), B Raven Press, New York, NY., Chapter 35. 

35 Another example of a SARS virus isolate is set forth in Example 1 below. The invention 

includes each of the polypeptide and polynucleotide sequences identified in Example 1. fii 
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addition, the invention includes vaccine formulations comprising one or more of the polyp^tide 
or polynucleotide sequences identified in Example 1. The invention includes diagnostic 
regaents, kits (con^sing such reagents) and methods which can be used to diagnose or identify 
the presence or absence of a SARS virus in a biological sample using one or more of the 
5 polypeptide or polynucleotide sequences idaitified in Example 1. Hie invention includes 
methods for the treatment or prevention of SARS virus infection utilizing small molecule viral 
inhibitors and combinations of small molecule viral inhibitors and kits for the treatment of 
SARS. The small molecule inhibitors may specifically target one or more of the polypeptides or 
polynucleotides identified in Example 1 . 

10 Further discussion of terms used in the application follows below. 

“Respiratory Virus” as used herein refers to a virus capable of infecting the human 
respiratory tract Respiratory Viral Antigens suitable for use in the invention include Severe 
Acute Respiratory Syndrome virus, coronavirus, influenza virus, human riiinovirus (HRV), 
parainfluenza virus CPIV), respiratory syncytial virus (RSV), adenovirus, met^neumovirus, and 
15 riiinovirus. 

Hie terms polypeptide”, “protein” and “ammo acid sequence” as used herein generally 
refer to a polymer of amino acid residues and are not limited to a minimum length of the product. 
Thus, peptides, oligopeptides, ;dimers, mulimers, and the like, are included within the definition. 
Both full-length proteins and fingments thereof are encompassed by the definition. Minimum 
20 fragments of polypeptides useful in the invention can be at least 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 

14, or even 15 amino acids. Typically, polypeptides useful in this invention can have a ' 
maximum length suitable for the intended application. Generally, the mayimum lengthiis not 
critical and can easily be selected by one skilled in the art 

Polypeptides of the invention can be prepared in many ways e.g. by chemical synthesis (at 
25 least in part), by digesting longer polypeptides using proteases, by translation from RNA, by 
purification fix>m cell culture {e.g. from recombinant expression), from the organism itself (e.g. 
after viral culture, or direct from patients), from a cell line source etc. A preferred method for 
production of peptides <40 amino acids long involves in vitro chemical synthesis (Bodanszky 
(1993) Principles of Peptide Synthesis (ISBN: 0387564314); Fields et oL (1997) Methods in 
30 Enzymology 289: Solid-Phase Peptide Synthesis. ISBN: 0121821900). Solid-phase peptide 
synthesis is particularly preferred, such as methods based on t-Boc or Fmoc (Chan & White 
(2000) Fmoc Solid Phase Peptide Synthesis ISBN: 0199637245) chemistry. Brzymatic synthesis 
(KuUmann (1987) Enzymatic Peptide Synthesis. ISBN: 0849368413) may also be used in part or 
in full. As an alternative to chemical synthesis, biological synthesis ma y be used e.g. the 
35 polypeptides may be produced by translation. This may be carried out in vitro or in vivo. 

Biological methods are in genraral restricted to the production of polypeptides based on L^amino 
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adds, but manipulation of translation machinery (e.g. of aminoacyl tRNA molecules) can be 
used to allow the introduction of D-amino adds (or of other non natural amino acids, such as 
iodotyrosine or methylphenylalanine, azddohomoalanine, etc.) (Ihba (1996) Biotechnol Genet 
Eng Rev 13:197-216.). Whae D-amino adds are included, however, it is preferred to use 
5 chemical synthesis. Polypeptides of the invention may have covalent modifications at die 
C-teiminus and/or N-terminus, particularly where diey are for in vivo administration e.g by 
attachment of acetyl or carboxamide, as in the Fuzeon™ product 

Reference to polypeptides and the like also includes derivatives of the amino acid 
sequences of the invention. Such derivatives can include postexpression modifications of the 
10 polypeptide, for exan^le, glycosylation, acetylation, phosphorylation, and the like. Amino add 
derivatives can also include modifications to the native sequence, such as deletions, additions 
and substitutions (^nranlly conservative in nature), so long as the protein maintains the desired 
activity. Diese modifications may be deliberate, as through site-directed mutagenesis, or may be 
acddental, such as through mutations of hosts which produce the proteins or errors due to PCR 
15 amplification. Furthermore, modifications may be made that have one or more of the following 
effects: reducing toxicity; facilitating ceD processing {e.g., secretion; antigen presentation, etc.)-, 
and facilitating presentation to B-cells and/or T-cells. 

‘Fragmenf ’ or “Portion” as used herein refers to a polypeptide consisting of only a part of 
the intact full-length polypeptide sequence and structure as found in nature. For instance, a 

10 fiagmentcanincludeaC-terminaldeletionand/or an N-tenninal deletion of a protein. 

A “recombinant” protein is a protein which has been prepared by recombinant DNA 
■ techniques as described herein. In general, the gene of interest is cloned and then expressed in 
transformed organisms, as described further below. The host organism expressed the foreign 
gene to produce the protein under expression conditions. 

:5 The term "polynucleotide", as known in the art, generally refers to a nucleic acid molecule. 

A polynucleotide” can include both double- and single-stranded sequences and refers to, but is 
not liimted to, cDNA from viral, prokaryotic or eukaryotic mRNA, genomic RNA and DNA 
sequences from viral {e.g. RNA and DNA viruses and retroviruses) or prokaryotic DNA, and 
especially synthetic DNA sequences. The term also captures sequences that include any of the 
0 known base analogs of DNA and RNA, and includes modifications such as deletions, additions 
and substitutions (generaUy conservative in nature), to the native sequence, so long as the nucleic 
acid molecule »icodes a therapeutic or antigenic protein. These modifications may be 
deliberate, as through site-directed mutagenesis, or may be accidental, such as through mutations 
of hosts that produce the antigens. Modifications of polynucleotides may have any number of 
5 effects including, for example, facilitating expression of the polypeptide product in a host cell. 
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Polynucleotides of the invention may be piepaied in many ways e.g. by chemical synthesis 
(e.g. phosphoiamidite synthesis of DNA) in whole or in part, by digesting longer nucleic adds 
using nucleases (e.g. restriction enzymes), by joining shorter nucleic adds or nucleotides (e.g. 
using ligases or polymerases), from genomic or cDNA libraries, etc. 

5 A polynucleotide can encode a biologically active (e.g. , immunogenic or therapeutic) 

protein or polypeptide. Depending on the nature of die polypeptide encoded by the 
polynucleotide, a polynucleotide can include as little as 10 nucleotides, e.g., where the 
polynucleotide encodes an antigen. 

By “isolated” is meant, when referring to a polynucleotide or a polypeptide, that the 
10 indicated molecule is separate and discrete from the whole organism witii which the molecule is 
found in nature or, when the polynucleotide or polypeptide is not found in nature, is sufficiently 
free of other biological macromolecules so that the polynucleotide or polypeptide can be used for 
its intended purpose. The polynucleotides and polypeptides of the invention are preferably 
isolated polynucleotides and isolated polypeptides. 

15 Antibody as known in the art includes one or more biological moieties that, through 

chemical or physical means, can bind to or associate with an epitope of a polypeptide of interest. 
The antibodies of the invention include antibodies which specifically bind to a SARS viral 
antigen. The tram “antibody” includes antibodies obtained fiom both polyclonal and monoclonal 
preparations, as well as the following: hybrid (chimeric) antibody molecules (see, for example, 
iO Winter et oL (1991) Nature 349: 293-299; and US Patent No. 4,816,567; F(ab ’)2 and F(ab) 
firagments; Fy molecules (non-covalent heterodimers, see, for example. Inbar et al (1972) Proc 
Natl Acad Sci USA ^:2659-2662; and Hirlich et al. (1980) Biochem 19:4091-4096); single- 
chain Fv molecules (sFv) (see, for example, Huston et al. (1988) Proc Natl Acad Sci USA 
M:5897-5883); dimeric and trimeric antibody fragment constracts; minibodies (see, e.g. , Pack 
15 et al (1992) Biochem 31 : 1579-1584; Cumber et al (1992) J Immunology 149B : 120-126); 

humanized antibody molecules (see, for example, Riechmann et al. (1988) Nature 332:323-327: 
Verhoeyan et al (1988) Science 239:1534-1536; and U.K. Patent Publication No. GB 2,276,169, 
published 21 September 1994); and, any functional fragments obtained from such molecules, 
wherein such fragments retain immunological binding properties of the parent antibody 
>0 molecule. The term antibody’ fiirthra includes antibodies obtained through non-conventional 
processes, such as phage display. 

As used herein, the term monoclonal antibody” refers to an antibody composition having 
a homogeneous antibody population. The term is not limited regarding the species or source of 
the antibody, nor is it intended to be limited by die manner in which it is made. Thus, the term 
5 encompasses antibodies obtained from murine hybridomas, as well as human monoclonal 
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antibodies obtained using human lather than munne hybndomas. See, e.g., Cote, et aL 
Monoclonal Antibodies and Cancer Therapy, Alan R. Liss, 1985, p 77. 

An immunogemc composition as used herein lefeis to a composition that comprises an 
antigenic molecule where administration of the composition to a subject results in the 
5 development in the subject of a humoral and/or a cellular immune response to the antigenic 
molecule of interest Hie immunogenic composition can be introduced directly into a recipient 
subject such as by injection, inhalatiom oral, intranasal or any other parenteral, mucosal or 
transdeiinal (e.g., intra-rectally or intra-vaginally) route of administration. 

Tte tom "derived from" is used to identify the source of molecule {e.g., a molecule can be 
10 derived from a polynucleotide, polypeptide, an immortalized cell line can be derived tom any 
tissue, etc.). A first polynucleotide is "derived tom" a second polynucleotide if it has the same 
or substantially the same basepair sequence as a region of the second polynucleotide, its cDNA, 
complements thereof, or if it displays sequence identity as described above. Thus, a first 
polynucleotide sequence is "derived tom" a second sequence if it has (i) the same or 
15 substantially the same sequence as the second sequence or (ii) displays sequence identity- to 

polypeptides of that sequence. 

A first polypeptide is "derived tom" a second polypeptide if it is (i) encoded by a first 
polynucleotide derived tom a second polynucleotide, or (ii) displays sequence identity to the 
second polypeptides as described above. Thus, a polypeptide (protein) is "derived tom" a 
10 particular S ARS virus if it is (i) encoded by an open reading frame of a polynucleotide of that 

SARS virus, or (ii) displays sequence identity, as described above, to polypeptides of tiiat SARS 
virus. 

Both polynucleotide and polypeptide molecules can be physically derived tom a SARS 
virus or produced recombinantly or synthetically, for example, based on known sequences. 

15 A cultured cell or ceU line is "derived tom" another cell, cells or tissue if it is originally 

obtained tom existing cells or tissue. Non-limiting exanq>les of tissue that cells ma y be derived 
tom include skin, retina, livor, kidney, heart, brain, muscle, intestinal, ovary, breast, prostate, 
cancerous tissue, tissue infected with one or more pathogens (e.g., viruses, bacteria etc.) and the 
Jlk®- The cells described herein may also be denved from other ceUs including, but not limited 
0 to, primary cultures, existing immortalized cells line and/or other isolated cells. 

An "antigen" refers to a molecule containing one or more epitopes (either linear, 
conformational or both) that will stimulate a host's immime system to make a humoral and/or 
cellular antigen-specific response. The term is used interchangeably with the term 
"immunogen." Normally, an epitope will include betwe«i about 3-15, generally about 5-15 
5 amino acids. A B-cell qpitope is normally about 5 amino acids but can be as small as 3-4 amino 
acids. A T-cell epitope, such as a CTL epitope, will include at least about 7-9 amino acids, and a 
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helper T-cell qntope at least about 12-20 amino acids. Normally, an ^itope will include 
between about 7 and 15 amino acids, such as, 9, 10, 12 or 15 amino acids. Ibe term “antigen” 
denotes both subunit antigens, {ue., antigens which are separate and discrete fiom a whole 
organism with which the antigen is associated in nature), as weU as, killed, attenuated or 
5 inactivated bacteria, viruses, fungi, parasites or other microbes as well as tumor antigens, 
including extraceUular domains of cell surface receptors and intracellular portions that may 
contain T-cell epitopes. Antibodies such as anti-idiotype antibodies, or fragments thereof, and 
synthetic peptide mimotopes, which can mimic an antigen or antigenic determinant, are also ' 
cultured under the definition of antigen as used herein. Similarly, an oligonucleotide or 
10 polynucleotide that expresses an antigen or antigenic deteminant in vivo, such as in gene therapy • 
and DNA inamnnization applications, is also included in the definition of antigen herein. 

An immunological response" to an antigen or composition is the development in a 
subject of a humoral and/or a cellulair immune response to an antigen present in the composition < 
of interest For purposes of the present invention, a "humoral immime response" refers to an 
5 unmune response mediated by antibody molecules, including secretory (IgA)-or IgG molecules, 

while a "cellule immune response" is one mediated by T-lymphocytes and/or other white blood 

cells. One important aspect of cellular inmiunity involves an antigen-specific response by 

cytolytic T-cells ("CTL"s). CTLs have specificity for peptide antigens that are presented in 
association with proteins encoded by the major histocompatibility complex (MHC) and 
0 expressed on the surfaces of ceUs. CTLs help induce and promote the destruction of intraceUular 

imcrobes, or the lysis of cells infected with such microbes. Another aspect of cellular immunity 

involves an antigen-specific response by helper T-cells. Helper T-cells act to help stimulate the 
function, and focus the activity of, nonspecific effector cells against cells displaying peptide 
antigens in association with MHC molecules on their surface. A “cellular immune response” 

5 also refers to the production of cytokines, chemokines and other such molecules produced by 
activated T-cells and/or other white blood cells, including those derived from CD4+ and CD8+ 
T-ceUs. In addition, a chemokine response may be induced by various white blood or 
endothelial cells in response to an administered antigen. 

U. VACCINE FORMULATIONS 

) The invention relates to vaccine formulations for the treatment or prevention of Severe 

Acute Respiratory Syndrome (SARS). Vaccine formulations of the invention include an 
inactivated (or killed) SARS virus, an attenuated SARS virus, a split SARS virus preparation and 
a recombinant or purified subunit formulation of one or more SARS viral antig^. The 
mvention includes polypeptides and polynucleotides encoding for SARS viral antigens and 
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immunogoiic fragments thereof. Expression and delivery of the polynucleotides of the invention 
may be facilitated via viral vectors and/or viral particles, including Virus T.iVft Particles (VLPs). 
A Inactivated ( or Killed) SARS Vaccines 

TTie invention includes a composition comprising an inactivated (or killed) SARS virus and 
5 methods for die production thereof. Liactivated SARS viral compositions can be used as 

imiphylactic or dierapeutic SARS virus vaccine. Preferably the inactivated SARS virus vaccine 
composition comprises an amount of inactivated SARS virus which, before inactivation, is 
equivalent to a virus titer of from about 4 to 7 logs plaque forming units (PFU) or 4 to 7 logs 
tissue culture infectious dose 50 (TCID50) per milliliter. More preferably, before inactivation the 
10 virus titer is from 4 to 11, 7 to 11 or 9 to 11 PFU or TCID50. Still more preferably die 

inactivated SARS virus vaccine composition comprises an amount of inactivated SARS virus 
which, before inactivation, is equivalent to a vims titer of from about 5 to 9 PFU or 5 to 9 
TODso per milliliter. In one embodiment, the PFU or TCID50 of the cultured SARS virus at 
harvest is 6 to 8, more preferably about 7.5 PFU or TCIDso per millili ter. Upon concentration of 
15 ’ the viral harvest, the PFU or TCID50 is preferably 8 to 1 1, still more preferably about 9 PFU or 

TCID50 per milliliter. The vaccine composition comprises a sufficient amount of the SARS vims 
antigen to produce an immunological response in a primate 

Methods of inactivating or killing viruses are known in the art to destroy the ability of the 
viruses to infect m a mm a li an cells. Such methods include both chemical or physical means. 

10 Chemical means for inactivating a SARS vimis include treatment of the vims with an effective 
amount of one or more of the following agents: detergents, formaldehyde, formalin, P- 
pfopiolactone, or UV light. Additional chemical means for inactivation include treatment with 
methylene blue, psoralen, caiboxyfuUermie (C60) or a combination of any thereof. Other 
methods of viral inactivation are known in the art, such as for example binary ethylamine, acetyl 
15 ethyleneimine, or gamma irradiation. 

For example formaldehyde may be used at concentrations such as 0.1 to 0.02%, preferably 
at 0.02 to 0.1 %, and still more preferably at 0.04 to 0.05%. The inactivating agent is to 
vfrus containing culture supernatants prior to or after harvesting said culture supernatants from 
vessels used for virus propagation, either widi or without a step of cell disruption for release of 
0 cell-associated virus prior to harvesting. Further, the inactivating agent may be added after said 
culture supernatants have been stored frozen and thawed, or after one or more steps of 
purification to remove cell contaminants. Prefraably, however, formaldehyde is added after 
removal of cells and cellular debris or after one or more purification steps. After addition of 
formaldehyde, the vims containing mixture is transferred into an incubation vessel and incubated 
5 at refrigeration temperatures (e.g. +2 to 8“C) or alternatively at elevated temperatures, such as 
ambient ten^)eratures between approximately 20 and 30®C or at 33®C to 37°C for a pmod of 12 
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hours to 7 days, whereby the temperature chosen should be adjusted to the duration of 
incubation. Prefered conditions are e.g. +2 -8“C for 3-7 days (prefered are 3 -7days), ambient 
temperatures and incubation for 16 hours to 3 days (prefered 24- 48 hours), or 35-37°C for 12-36 
hours. If it is desirable to remove excess formalin, sodium thiosulfate or sodium metabisulfite at 
5 equimolar or 1.5 -fold molar concentration (relative to formaldehyde) may be added after 
completing the inactivation process. 

For example, P-propiolactone may be used at concentrations such as 0.01 to 0.5%, 
preferably at 0.5% to 0.2%, and stiU more preferably at 0.025 to 0.1%. The inactivating agent is 
added to virus containing culture supernatants (virus material) prior to or after harvesting said 
10 culture supernatants from vessels used for virus propagation, either with or without a step of cell 

disruption for release of cell-associated virus prior to harvesting. Further, the inactivating agent 
may be added after said culture supOTiatants have been stored fiwzen and thawed, or after one or 
more steps of purification to remove ceU contaminants. P-propiolactone is added to the virus 
material, with the adverse shift in pH to acidity being controlled with sodium hydroxide (e.g., 1 
.5 N NaOH), a Tr|s-buffer or sodium bicarbonate solution. After transfering the mixture to anotiier 
inactivation vessel, the combined inactivating agent-vims materials are incubated at temperatures 
from 4'’C to 37°C, for incubation times of preferably 24 to 72 hours. 

Another inactivant which may be used is binary ethyleneimine (BEI). Equal volumes of a 
0.2 molar bromoethylamine hydrobromide solution and a 0.4 molar sodium hydroxide solution 
0 are mixed and incubated at about 37°C. for 60 minutes. Tlie resulting cyclized inactivant is 
binary ethyleneimine, which is added to the virus materials at 0.5 to 4 percent, and preferably at 
1 to 3 percent, volume to volume. The inactivating virus materials are held from about 4°C to 
37°C for 24 to 72 hours wiA periodic agitation. At the end of this incubation 20 ml. of a sterile 1 
molar sodium thiosulfate solution was added to insure neutralization of the BEL 
5 hi one embodiment, the invention includes an inactivating method is designed to maTimlyp 

exposure of the virus to the inactivating agent and to miniinize long-term exposure of the 
temperature sensitive SAKS virus particles to elevated temperatures. ITie invention includes an 
inactivation method comprising exposing the virus to the inactivation agent (such as BPL) for 12 
to 24 hours at refrigeration tempaatures followed by hydrolysis of any residual inactivating 
0 agent by elevating the temperature for only 3 hours. Preferably, the refiigeration temperatures 
are between 0 and 8“C, more prefraably around 4°C. Preferably, the elevated temperature is 
between 33 and 41°C, more preferably around 37“C. As assessed by a test for residual infectious 
virus using 10 ml aliquots of the inactivated preparation, the method is able to inactivate SARS- 
CoV in raw cell culture harvests below a theoretical limit of 0.03 infectious units/ml. 

5 Diluted and undiluted samples of the inactivated virus materials are gdded to susceptible 

cell (tissue) culture (e.g., VERO) to detect any non-inactivated virus. The cultured cells are 
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- passaged multiple times and examined for the presence of SARS virus based on any of a variety 
of methods, such as, for example, cytopathic effect (CPm and antigen detection (e.g., via 
fluorescent antibody conjugates specific for SARS virus). Such tests allow determination of 
complete virus inactivation. 

5 Prior to inactivation, the SARS virus will be cultured in a mammalian cell culture. The cell 

culture may be adherently growing ceUs or cells growing in suspension. Preferably the cells are 
of mammalian origin, but may also be derived from avian (e.g., hens’ cells such as hens’ embryo 
cells (CEF cells)), amphibian, reptile, insect, or fish sources. Mammalian sources of cells 
include, but are not limited to, human or non-human primate (e.g., MRC-5 (ATCC CCL-171), 

10 WI-38 (ATCC CCL-75), HeLa cells, human diploid cells, fetal Aesus lung cells (e.g. ATCC CL- 

160), human embryonic kidney cells (293 cells, typically transformed by sheared adenovirus 
type 5 DNA), VERO cells (e.g., from monkey kidneys), horse, cow (e.g., MDBK cefls), sheep, 
dog (e.g., MDCK cells from dog kidneys, ATCC CCL34 MDCK (NBL2) or MDCK 33016, 
deposit number DSM ACC 2219 as described in WO 97/37001), cat, and rodent {e.g.. hamster 
15 cells such as BHK21-F, HKCC cells, or Chinese hamster ovary cells (CHO cells)), and may be 
obtained from a wide variety of developmental stages, including for example, adult, neonatal, 
fetal, and embryo. 

In certain embodiments the cells are immortalized {e.g., PERC.6 cells are described, for 
example, in WO 01/38362 and WO 02/40665, incorporated by reference herein in their 
iO entireties, as weU as deposited under ECACC deposit number 96022940), or any other cell type 
immortalized using the techniques described herein. 

In preferred embodiments, mammaUan cells are utilized, and may be selected fiom and/or 
derived from one or more of the following non-limiting cell types: fibroblast cells {e.g., dermal, 
lung), endothelial cells {e.g . , aortic, coronary, pulmonary, vascular, dermal microvascular, 
t5 umbilical), hepatocytes, keratinocytes, immune cells {e.g . , T cell, B cell, macrophage, NK^ 

dendritic), m a mmar y cells {e.g., epithelial), smooth muscle cells (e.g., vascular, aortic, coronary, 
arterial, uterine, bronchial, cervical, retinal pericytes), melanocytes, neural cells {e.g., astrocytes), 
prostate cells {e.g., epiAelial, smooth muscle), renal cells {e.g., epithelial, mesangial, proximal 
tubule), skeletal cells {e.g., chondrocyte, osteoclast, osteoblast), muscle cells {e.g., myoblast, 

0 skeletal, smooth, bronchial), liver cells, retinoblasts, and stromal cells. WO 97/37000 and WO 
97/37001, incorporated by reference herein in their entireties, describe production of animal cells 
and cell lines that capable of growth in suspension and in serum ^ media and are useful in the 
production and leplication of viruses. 

Prefraably, the SARS viruses of Ae invention are grown on VERO cells or fetal Aesus 
5 kidney cells. 
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Culture conditions for the above cell types are well-described in a variety of publications; 
or alternatively culture medium, supplements, and conditions may be purchased commercially, 
such as for example, as described in the catalog and additional Uteratuie of Cambrex Bioproducts 
(East Rutherford, NJ). 

5 In certain embodiments, the host cells used in the methods described herein are cultured in 

serum free and/or protdn free media. A medium is referred to as a serum-free mptHinm in the 
context of the present invention in which there are no additives from serum of human or animal 
origin. Protein-free is understood to mean cultures in which multiplication of the cells occurs 
with exclusion of proteins, grov»^ factors, other protein additives and non-serum proteins. The 
10 cells growing in such cultures naturally cont^ proteins themselves. 

Known serum-free media include Iscove's medium, Ultra-CHO medium (BioWhittaker) or 
EX-CELL (JRH Bioscience). Ordinary serum-containing media include Eagle's Basal Mfvttnm 
(BME) or Minimum Essaitial Medium (MEM) (Eagle, Science, 130, 432 (1959)) orDulbecco's 
Modified Eagle Medium (DMEM or EDM), which are ordinarily used with up to 10% fetal calf 
15 serum or similar additives. Optionally, Minimum Essential Medium (MEM) (Eagle, Science, 
130, 432 (1959)) or Dulbecco’s Modified Eagle Medium (DMEM or EDM) may be used without 
any serum containing supplement. Protein-free media like PF-CHO (JHR Bioscience), 
chemicafiy-defined media like ProCHO 4CDM (BioWhittaker) or SMIF 7 (Gibco/BRL Life 
Technologies) and mitogenic p^tictes like Primactone, Pepticase or HyPep™ (aU from Quest 
10 International) or lactalbumin hydrolyzate (Gibco and other manufacturers) are also adequately 
known in the prior art The media additives based on plant hydrolyzates have the special 
advantage that contamination with viruses, mycoplasma or unknown infectious agents rstn be 
ruled out 

. ' The cell culture conditions to be used for the desired application (temperature, cell density, 
15 pH value, etc.) ate variable over a very wide range owing to tiie suitability of the cell line 

employed according to the invention and can be adapted to the requirements of the SARS virus. 

Hie method for propagating the SARS virus in cultured cells (e.g., mammalian cells) 
includes the steps of inoculating the cultured cells with SARS virus, cultivating the infected ceUs 
for a desired time period for virus propagation, such as for example as determined by SARS 
>0 virus titer or SARS virus antigen expression (e.g., between 24 and 168 hours after inoculation) 
and collecting the propagated virus. The cultured cells are inoculated with a SARS virus 
(measured by PFU orTCIDso) to cell ratio of 1:10000 to 1:10. A lower range of ratios may also 
be used e.g. 1:500 to 1:1, preferably 1:100 to 1:5, more preferably 1:50 to 1:10. The SARS virus 
is added to a suspension of the cells or is qiplied to a monolayer of the cells, and the virus is 
i5 absorbed on the cells for at least 60 minutes but usually less than 300 minntf»g preferably 
between 90 and 240 minutes at 25°C to 40®C, more preferably 28°C to 37®C, still more 
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preferably at about 33 °C. TTie infected ceU culture (e.g., monolayers) may be treated either by 
fteeze-thawing or by enzymatic action to increase the viral content of the harvested culture 
supernatants. The harvested fluids are thrai either inactivated or stored fiozen. 

A comparison of SARS infected Vero cells grown with and without fetal calf serum 

5 ("FCS")is shov™mHOURE26A. Briefly, Vero cell, were split the day brfoie infection and 

cultivated in T175 flasks. Infection of 90% confluent Vero cell monolayeis the following day 
was perfbnned with a SARS^CoV seed stock (strain FRA, passage 4, Accession number 
AY3I0120),withorwithout3%FCS(Tig.26A). Hie addition of FCS to the cefl media showed 

little impact on virus yield. 

10 Cultured cells may be infected at a multiplicity of infection (“m.o.i.”) of about 0.0001 to 

10, preferably 0.002 to 5. more preferably to 0.001 to 2. Still more preferably, the cells are 

infected at a m.o.i of about 0.01. A comparison of viral yield at varying m.o.i. levels is shown in 
FIGURE 26B. 

Infected cells may be harvested 30 to 60 hours post infection. Preferably, the cells are 
5 harvested 34-48 hours post infection. Still more preferably, the cells are harvested 38 to 40 
hours post infection. See FIGURE 26C. 

Methods of purification of inactivated virus are known in the art and may include one or 
more of, for instance gradient centrifugation, ultracentrifugation, continuous-flow 
ultracentrifugation and chromatography, such as ion exchange chromatography, size exclusion 
0 chromatography, and liquid affinity chromatography. Additional method of purification include 
ultrafiltration and dialfiltration. See JP Gregersen ‘TIerstellung von Virussimpfstoffen aus 
Zellkulturen” Chapter 4.2 in Phatmazeutische Biotecnologie (eds. O. Kayser and RH MueUer). 
Wissenschaftliche Verlagsgesellschaft, Stuttgart, 2000. See also, O’Neil et al., “Virus 

Harvesting and Affinity Based liquid Chroinatography. A Method for Virus Concentration and ■ 

5 Purification”, Biotechnology (1993) 11:173-177; Prior era/., “Process Development for 

Manufacture of Inactivated mV-1”, Pharmaceutical Technology (1995) 30-52; and Majhdi etai, 
“Isolation and Characterization of a Coronaviius from Elk Calves with diarrhea” Journal of 
Clinical Microbiology (1995) 35(11): 2937-2942. 

Other examples of purification methods suitable for use in the invention include 
> polyethylene glycol or ammonium sulface precipitation (see Trepanier etoL, “Concentration of 

human respiratory syncytial virus using ammonium sulfate, polyethylene glycol or hollow fiber 
ultrafiltration” Journal of Virological Methods (1981) 3(4):201-211; Hagen etoL, “Optimization 
of Poly(ethylene glycol) Precipitation of Hepatitis Virus Used to prepare VAQTA, a Highly 
Purified Inactivated Vaccine” Biotechnology Progress (1996) 12:406412; and Carlsson et al., 
“Purification of Infectious Pancreatic Necrosis Virus by Anion Exchange Chromatography 
fiicreases the Specific Mectivity” Journal of Virological Methods (1994) 47:27-36) as well as 
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ultrafiltration and microfiltration {see Pay et al. , Developments in Biological Standardization 
(1985) 60: 171-174; Tsurumi et aL, “Structure and filtration performances of improved 
cuprammonium regenerated cellulose hollow fibre (improved BMM hollow fibre) for virus 
removal” Polymer Journal (1990) 22(12):1085-1100; and Makino et al., “Concentration of live 
5 retrovirus with a regenerated cellulose hollow fibre, BMM”, Archives of Virology (1994) 139(1- 
2):87-96.). 

Preferably, the virus is purified using chromatography, such as ion exchange 
chromatogr^hy. Chromatic purification allows for the production of large volumes of virus 
containing suspension. The viral product of interest can interact with the chromatic medium by a 
10 simple adsorpdon/desorption mechanism, and large volumes of sample can be processed in a 
single load. Contaminants which do not have affinity for die adsorbent pass through the column. 
The virus material can then be eluted in concentrated fo rm 

Preferred anion exchange resins for use in the invention include DEAE, EMD TMAE, 
Preferred cation exchange resins may comprise a sulfonic acid-modified surface. In oiie 
15 embodiment, the virus is purified using ion exchange chromatography comprising a strong anion 

exchange resin {e.g. EMD TMAE) for the first step and EMD-SO 3 (cation exchange resin) for 
the second step. A metal-binding affinity chromatography stq> can optionally be included for 
further purification. (See, e.g., WO 97/06243). 

A preferred resin for use in the invention is Fractogel™ EMD. This synthetic methacrylate 
20 based resin has long, linear polymer chains (so-called "tentacles") covalently attached. Tliis 

“tentacle chemistry” allows for a large amount of sterically accessible ligands for the binding of 
biomolecules without any steric hindrance. This resin also has improved pressure stability. 

Column-based liquid affinity chromatogr^hy is another preferred purification method for 
use in the invention. One example of a resin for use in this purification method is Matrex™ 

25 Cellufine™ Sulfate (MCS). MCS consists of a rigid spherical (approx. 45-105 /tm diameter) 
cellulose matrix of 3,000 Dalton exclusion limit (its pore structure excludes macromolecules), 
with a low concentration of sulfate ester functionality on the 6-position of cellulose. As the 
functional ligand (sulfate ester) is relatively highly dispersed, it presents insufficient cationic 
charge density to allow for most soluble proteins to adsorb onto the bead surface. Therefore the 
10 bulk of the protein found in typical virus pools (cell culture supernatants, e.g. pyrogens and most 
contaminating proteins, as weD as nucleic acids and endotoxins) are washed from the column 
and a degree of purification of the bound virus is achieved. 

The rigid, high-strength beads of MCS tend to resist compression. The pressure/flow 
characteristics the MCS resin permit high linear flow rates allowing high-speed processing, even 
i5 in large colunms, making it an easily scalable unit operation. In addition a chromatographic 
purification step with MCS provides increased assurance of safety and product sterility, avoiding 
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excessive product handling and safety concerns. As endotoxins do not bind to it, the MCS 
purification step allows a rapid and contaminant free depyrogenation. Gentle binding and 
elution conditions provide hi^ edacity and product yield. TTie MCS resin therefore represents 
a simple, r^id, effective, and cost-saving means for concaitration, purification and 
5 depyrogenation. In addition, MCS resins can be reused repeatedly. 

The inactivated virus may be further purified by gradirat centrifugation, preferably density 
gradient centrifugation. For commercial scale operation a continuous flow sucrose gradient 
centrifugation would be the preferred option. This method is widely used to purify antiviral 
vaccines and is known to the expert in the field (See JP Gregersen “HersteUung von 
10 Virussimpfstoffen aus Zellkulturen” Chapter 4.2 in Pharmazeutische Biotecnologie (eds. O. 

Kayser and RH Mueller) Wissenschaftliche Verlagsgesellschaft, Stuttgart, 2000.) 

The density gradient centrifugation step may be performed using laboratory or conunercial 
scale gradient centrifugation equipment For example, a swinging bucket rotor, a fixed angle 
rotor, or a vertical tube rotor, particularly for laboratory scale production of the virus. • 

15 Preferably, the gradient centrifugation step is performed using a swinging bucket rotor. This 

type of rotor has a sufficiently long pathlength to provide high quaUly separations, particularly 
with multicomponent samples. In addition, swinging bucket rotors have greatly reduced wall 
effects, and the contents do not reorient during acceleration and deceleration. Because of their 
longer pathleigth, separations take longer compared to fixed angle or vertical tube rotors. The 
to prepared sucrosO solutions are controlled via refractometer on their sucrose concentration. 

Sucrose gradirats for density gradient centrifugation, such as. in a swinging bucket 
caitrifuge tubes may be farmed prior to centrifugation by the use of a gradient former 
(continuous/linear). The volume of sample which can be applied to the gradient in a swinging 
bucket rotor tube is a function of the cross-sectional area of the gradient that is exposed to the 
;5 sample. If the sample volume is too high, there is not sufficient radial distance in the centrifuge 
tube for effective s^aration of components in a multicomponent s amp li* 

An approximate sample volume for swinging bucket rotor SW 28 is 1-5 ml per tube (with a 
tube diameter of 2.54 cm). The sample is ^plied to tiie gradient by pipetting die volume on top 
of the gradient The blunt end of the pipette is placed at 45-60® angle to the tube wall, 

0 ^iproximately 2-3 mm above the gradient The sample is injected slowly and allowed to run 
down the waU of the tube onto the gradient After centrifugation gradient fractions are recovered 
by carefully inserting a gauge needle until the bottom of the tube and starting to collect fractions 
of 2 ml by pumping the liquid from the tube into falcon tubes. 

Sucrose density gradients suitable for u^ with this density gradient centrifugation 
5 purification step include 0 - 60%, 5 - 60%, 15 - 60%, 0 - 50%, 5 - 50%, 15 - 50%. 0 - 40%, 5 
- 40%, and 15 - 40%. Preferably, the sucrose density gradient is 15 - 40%, 5 - 40% or 0 - 40%. 
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Alternatively, a discontinuous sucrose density gradient may be used for purification. A 
discontinuous sucrose density scheme provides for discrete, overiaying layers of difFra-ing 
sucrose concraitrations. In one exanqple, a first layer of 50% sucrose is covered by a second 
layer of 40% sucrose; die second layer is covraed by a third layer of 20% sucrose; the third layer 
5 is covered by a fourth layer of 10% sucrose; and the fourth layer is covered by the solution 
containing the virus to be purified. 

Ih one embodiment, inactivated virus is purified by a method com prising a first step of 
chromatography purification and a second step of gradient centrifugation. Preferably the first 
step comprises liquid affinity chromatography, such as MCS. Preferably, the second step 
10 conqmses density gradient centrifugation using a swin g ing bucket rotor. 

Additional purification methods which may be used to purify inactivated SARS virus 
include the use of a nucleic acid degradmg agent, preferably a nucleic acid degrading enzyme, 
such as a nuclease having DNase and RNase activity, or an endonuclease, such as from Serratia 

I 

marcescenSf commerdaUy available as Benzonase™, membrane adsorbers with anionic 
15 fimctional groups {e.g. Sartobind™) or additional chromatographic steps with anionic functional ' 
groups (e.g. DEAE or TMAE). An ultrafiltration/dialfiltration and final sterile filtration step 
could also be added to the purification method. 

Preferably, the purification includes treatment of the SARS viral isolate with one or more 
nucleic add degrading enzymes. These enzymes may be used to reduce the level of host cell 
iO nucleic add in the viral purific^on process. Nucleic acid digesting enzymes for use in cell 
culture are known in the art and include, for example, Benzonase™. 

The treatment of the virus with the nucldc add degrading enzyme and inactivating agent 
can be performed by a sequential treatment or in a combined or simultaneous manner. 

Preferably, the nucleic acid degrading agent is added to the virus preparation prior to the addition 
15 of the inactivating agent. 

The purified viral preparation of the invention is substantially free of contaminating 
proteins derived from the ceDs or cell culture and preferably comprises less than about 1000, 

500. 250. 150. 100, or 50 pg cellular nucleic acid / fig virus antigen, preferably less than about 

1000. 500. 250. 150. 100, or 50 pg cellular nucleic acid/ dose. Still more preferably, the purified 
iO viral prq>aration comprises less than about 20 pg, and even more preferably, less than about 10 

pg. Methods of measuring host cell nucleic add levels in a viral sample are known in the art 
Standardized methods q>proved or recommended by regulatory authorities such as the WHO or 
the EDA are preferred. 

The invention includes an inactivated vaedne composition comprising a prophylacticaUy 
>5 effective amount of SARS viral antigen, preferably spike or an immunogenic fra gme nt thereof. 
The SARS viral antigen is preferably present in a concentration amount of 0.1 to 50 fig 
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antigCT/dose, more preferably 03 to 30 pg antigen/dose. Still more preferably, the antigen is 
about IS pg/dose. 

In one embodiment, a Iowct concentration of SARS viral antigen is used in inactivated 
vaccine compositions of the invention. Such lower concentration vaccines may optionally 
5 con^iise an adjuvant to boost the host immune response to the antigen, ha such a “low dose” 
vaccine, the SARS viral antigen is preferably present in a concentration of less than 15 fig 
antigen/dose, (Le., less than 10, 7.5, 5 or 3 pg antigen/dose. 

The inactivated vaccine preparations of the inveaition may further comprise a stabilizer to 
preserve the integrity of the immunogenic proteins in the inactivated viral preparation. 

10 Stabilizers suitable for use in vaccines are known in the art and may include, for example, 

buffers, sugars, sugar alcohols, and amino acids. Stabilizing buffers are preferably actuated to a 
physiological pH range and may include phosphate buffers, Tris buffers, TE (Tris/EDTA), TEN 
fTris/NaCl/EDTA) and Earle s salt solution. Stabili 2 dng sugars may include, for example^ one or 
more of saccharose, glucose, fructose, dextranes, dextranesulphate, and trehalose. Stabilizing 
15 sugar alcohols may include, for example, Xylite/Xylitole, Mannite/Mannitol, Sorbite/Sorbitol, 
and Glycerol. Amino adds suitable for use in the invention include, for example, L-glutamine, 
arginine, cysteine, and lysine. Additional stabilizers which may be used in the invention include 
Tartaric add, Pluronic F 68, and Tween 80. 

SARS viral isolates which may be used for the inactivated viral preparations of the ' 
to invention may be obtained and identified by any of the mechanisms described supra. For 

exanqile, a SARS isolate may be obtained from a clinical sample and plaque purified. Such 

methods of viral isolation are known in the art 

Further purification procedure can be ^lied to ensure the seed virus used for preparation 
of the vaccine does not contain, for example, unwanted adventitious agents. In one embodiment 
:5 viral RNA from the viral isolate can be isolated from the virus, purified (and, optionally, the 
sequence verified through PCR or other means) and then introduced into a suitable cell culture. 

As an exainple of this technique, a clinical viral sample is plaque purified and amplified on 
vero cells to genraate a sufficient amount of the viral sample for analysis. Cellular remnants are 
then cleared from the supernatant by centrifugation. The virus can thai be pelleted by 
0 ultracentrifugation and the pellet resuspended in PBS. After further centrifugation purification, 
the virus containing fraction is treated with a DNase (and optionally also an RNase)' Viral RNA 
is then isolated from this fiaction and transfected into a host cell. 

Exan^les 2 and 3 provide an illustration of purification of inactivated whole SARS virus 
using MCS chromatography resin purification followed by density gradient ultracentrifugation. 
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Routes and methods of immunization of the vaccines of the invmition are disrnssf^d in 
more derail in a section below. Examples 4 and S provide illustrations of a mouse immunization 
scheme with the inactivated SARS virus of the invention. 

B. Attenuated SAKS Vaccines 

5 The invention includes a con^rosition comprising an attenuated SARS virus. This 

composition can be used as a prophylactic or therapeutic SARS virus vaccine. Methods of 
attenuating viruses are known in the art Such methods include serial passage of the SARS virus 
in cultured cells (e.g., mammalian cell culture, preferably fetal rhesus kidney cells or VERO 
cells-see the discussion in Section A above regarding culture of SARS virus), until the SARS 
10 virus demonstrates attenuated function. The temperature at which the virus is grown can be any 
temperature at which with tissue culture passage attenuation occurs. Attenuated function of the 
SARS virus after one or more passages in cell culture can be measured by one skilled in the art 
As used herein, attenuation refers to the decreased virulence of the SARS virus in a human 
subject Evidence of attenuated function may be indicated by decreased levels of viral 
15 repMc^on or by decreased virulence in an animal model. 

Other methods of producing an attenuated SARS virus include passa^ of the virus in cell 
culture at sub-optimal or “cold” temperatures and introduction of attenuating mutations into the 
SARS viral genome by random mutagenesis (e.g., chemical mutagenesis) or site specific directed 
mutagenesis. Preparation and generation of attenuated RSV vaccines (the methods of which will 
10 generally ^licable to SARS virus) are disclosed in, for example, EP 0 640 128, US Patent No. 

6,284,254, US Patent No. 5,922,326, US Patent No. 5,882,651. 

Ihe attenuated derivatives of SARS virus are produced in several ways, such as for • 
example, by introduction of temperature sensitive-mutations either with or without chemical 
mutagenesis (e.g., 5-fluorouracil), by passage in culture ait “cold” temperatures. Such cold 
15 adaptation includes passage at temperatures between about 20“C to about 32°C, and preferably 
betwewi tempCTatures of about 22°C to about 30°C, and most preferably between temperatures 
of about 24°C and 28°C. TTie cold adaptation or attenuation may be performed by passage at 
increasingly reduced temperatures to introduce additional growth restriction mutations. Hie 
number of passages required to obtain safe, immimizing attenuated virus is dependent at least in 
!0 part on the conditions employed. Periodic testing of the SARS virus culture for virulence and 
immunizi ng ability in animals (e.g., mouse, primate) can readily determine the parameters for a 
particular combination of tissue culture and temperature. Hie attenuated vaccine will typically be 
formulated in a dose of from about 10^ to 10^ PFU or TCID50, or more for maximal efficacy. 

Attenuated virus vaccines for SARS-CoV also are produced by creating virus chimeras 
i5 comprising sequences derived from at least two different coronaviruses, one of which is a SARS- 

CoV . For example, a virus chimera is produced that comprises nonstructural protein encoding 
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genes derived from a first coronavirus (e.g., murine, bovine, porcine, canine, feline, avian 
coronavirus) and one or more structural protein encoding gaies {e.g., spike, E, fiom a SARS- 
CoV. Alternatively, the virus chimera may comprise sequences derived from a h uman 
coronavirus that is not a SARS-CoV OC43, 229E) together with sequences fiom a SARS- 

5 CoV . Chimeric coronaviruses of the presait invention are generated by a variety of methods, 

including for example allowing for natural RNA recombination in a eukaryotic {e.g., 
mammalian) cell that contains RNA from each of the parental coronaviruses {e.g., following 
infection) or by using standard molecular biology techniques known to those of skill in the art to 
engineer desired virus chimeras (or portions thereof) as cDNA clones, which may then be used to 
10 produce infectious virus (see for example, US 6593111 B2; Yount et ed., 2003, Proc. Natl. Acad. 
ScL USA 100(22):12995-13000). An attenuated phenotype of the coronavirus chimeras 
described herein can be readily measured by one of skUl in the art. 

Attenuated viruses can be also generated by deleting one or more open reading fiames 
(ORFs) that are not essential for viral replication. Preferably, these deletions occur in the • 

.5 structural region of the genome, such as ORF 3a, 3b, 6, 7a, 7b, 8a, 8b, 9b. See e.g., Haijema BJ, 
Volders H, Rottier PJ. J Virol. (2004) 78(8):3863-71; and de Haan, C. A., P. S. Masters, X. Shen, 
S. Weiss, and P. J. Rottia-, “The group-specific murine coronavirus genes are not essential, but 
their deletion, by reverse genetics, is attenuating in the natural host.” Virology (2002) 296: 177- 
189. Deletion of such regions within a coronavirus such as SARS can be achieved, for example, 

:0 by revCTse genetics or "targeted recombination" (See, e.g.. Masters, P. S., “Reverse genetics of 

the largest RNA viruses”, Adv. Virus Res. (1999) 53:245-264. ■ 

Methods of purification of attenuated virus are known in the art and may include one or . 
more of, for instance gradient centrifugation and chromatography. See Giegersen “Herstellung 
von Virussimpfstoffen aus Zellkulturen” Chapter 4.2 in Pharmazeutische Biotecnologie (eds. O. 

5 Kayser and RH Mueller) WissenschafUiche Veriagsgesellschaft, Stuttgart, 2000. 

_C Split SARS Vaccines 

The invention includes a composition comprising a split SARS virus formulation and 
methods for the manufacture thereof. This composition can be used as a prophylactic or 
therapeutic SARS virus vaccine. 

0 Methods of splitting enveloped viruses are known in the art. Metiiods of splitting 

enveloped viruses are disclosed, for example, in WO 02/28422, incorporated herein by reference 
in its entirety, and specifically including the splitting agents and methods described therein. 
Methods of splitting influenza viruses are disclosed, for example, in WO 02/067983, WO 
02/074336, and WO 01/21151, each of which is incorporated herein by reference in its entirety. 

5 The splitting of the virus is carried out by dismpting or ftagmenting whole virus, infectious 

(wild-type or attenuated) or non-infecdous (for example inactivated), with a dismptirlg 
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concentration of a splitting agent The disruption results in a full or partial solubilisation of the 
virus protdns, altering the integrity of the virus. 

Prefeably, the splitting agent is a non-ionic or an ionic surfactant Accordingly, die split 
SARS virus formulations of the invention may also comprise at least one non-ionic surfactant or 
5 detergent Examples of splitting agents usefid in the invention include: bile acids and derivatives 
thereof, non-ionic surfactants, alkylglycosides or alkylthioglycosides and derivatives thereof, 
acyl sugars, sulphobetaines, betains, polyoxyethylenealkylethers, NJ^-dialkyl-Glucamides, 
Hecameg, aUcylphenoxypolyethoxyethanols, quaternary ammonium compounds, saicosyl, CTAB 
(cetyl tiimetbyl ammonium bromide) or Cetavlon. 

10 Preferably, the ionic surfactant is a cationic detergent Cationic detergents suitable for use 

in the invention include detergents comprising a compound of the following formula: 

X" 

wherein 

Ri, R2 and R3 are the same or different and each signifies alkyl or aryl, or 
15 Ri and R2, together with the nitrogen atom to which these are attached form a 5- or 6- 

membered heterocyclic ring, and 

R3 signifies alkyl or aryl, or 

Ri, R2 and R3 together with the nitrogen atom to which these are attached, signify a 5- or 6- 
membered heterocyclic ring, unsaturated at the nitrogen atom, 

10 R4 signfies alkyl or aryl, and 

X signifies an anion. 

Examples of such cationic detergents are cetyltiimethylanimonium salts, such as 
ceytltrimethylammonium bromide (CTAB) and myristyltrimethylammonium salt 

Additional cationic deter^ts suitable for use in the invention include lipofectine, 

15 lipofectamine, and DOT-MA. 

Non-ionic surfact^ts suitable for use in the invention include one or more selected from 
the group consisting of the octyl- or nonylphenoxy polyoxyethanols (for example the 
commercially available Triton series), polyoxyethylene soibitan esters (Tween s^es) and 
polyoxyethylene ethers or esters of the ^heral f ormula : 

>0 0(CH2CH20)„-A-R 

wherein n is 1-50, A is a bond or — C(0)-, R is C1.50 alkyl or phenyl C1.50 alkyl; and 
combinations of two or more of these. 

The invention comprises a method of preparing a split SARS virus comprising contacting 
the SARS virus with a sufficient amount of splitting agent to disrupt the viral envelope. The loss 
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of integrity afto splitting renders the virus non-infectious. Once the disrupted viral oivelope 
proteins are generally no longer associated with whole intact virions, other viral proteins are 
preferably fully or partially solubilized and are dierefore not associated, or only in part 
associated, with whole intact virions after splitting. 

5 Hie method of pr^aring a split SARS virus may furdier comprise removal of die splitting 

agents and some or most of the viral lipid material. The process may also include a number of 
different filtration and/or othra- separation stqis such as ultracentrifugation, ultrafiltration, zonal 
centrifugation and chromatographic steps in a variety of combinations. Hie process ma y also 
optionally include an inactivation step (as described above) which may be carried out before or 
10 after the splitting. The splitting process may be carried out as a batch, continuous, or semi- 
continuous process. 

Split SARS virus vaccines of the invention may include structual proteins, mt»mhraiip. 
fragments and membrane envelope proteins. Preferably, the split SARS virus preparations of the 
invention comprise at least half of the viral structural proteins. 

15 One example of a method of preparing a split SARS vims formulation includes the 

following steps: 

(i) propagation of the SARS virus in cell culture, such as MRC-5 cells (ATCC CCLrYll), 
WI-38 cells (ATCC CCL-75), fetal rhesus kidney cells or veto cells (See the discussion in 
Section A, above, regarding culture of SARS virus); 

20 (ii) harvesting of SARS virus-containing material from the ceD culture; 

(iii) clarification of the harvested material to remove non-SARS virus material; 

(iv) concentration of the harvested SARS virus; 

(v) s^aration of die whole SARS virus ftom non-virus materi al- 

(vi) splitting of the whole SARS virus using a suitable splitting agent in a density gradient 
25 centrifugation step; and 

(vii) filtration to remove undesired materials . 

The above steps are preferably performed sequentially. 

The clarification step is preferably performed by centrifugation at a mnder ate speed. 
Altranatively, a filtration stq> may be used for example with a 0.2^m membrane. 

10 The concCTtration step may preferably onploy an adsorption method, for instance, using 

CaHP 04 . Alternatively, filtration may be used, for example ultrafiltration. 

A further separation step may also be used in the method of the invention. This further 
separation step is preferably a zonal centrifugation separation, and may optionally use a sucrose 
gradient The sucrose gradient may further comprise a presravative to prevent microbial growth. 
15 The splitting step may also be performed in a sucrose gradient wherein the sucrose 

gradient contains the splitting agent 
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The method may further comprise a sterile filtration step, optionally at the end of the 
process. Preferably, there is an inactivation step prior to the final filtration step. 

Methods of preparing split S ARS virus formulations may further include treatment of the 
viral formulation with a DNA digesting enzyme. These enzymes may be used to reduce the level 
5 of host cell DNA in the viral purification process. DNA digesting enzymes for use in cell culture 
are known in the art and include, for example, Benzonase®. 

Treatment of the SARS virus formulation with a DNA digesting enzyme may occur at any 
time in the purification and splitting process. Preferably, however, the SARS virus formulation 
is treated with a DNA digesting enzyme prior to use of a deferent Still more preferably, the 
0 SARS virus formulation is treated with a DNA digesting enzyme, such as Braizonas, prior to 
treatment with a cationic detergent, such as CTAB. 

Methods of purification of split virus are known in the art See JP Gregasen “Herstellung 
von Vimssimpfstoffen aus Zellkulturen” Chapter 4.2 in Pharmazeutische Biotecnologie (eds. O. 
Kayser and RH Mueller) WissenschafUiche Yerlagsgesellschaft, Stuttgart, 2000. 

5 The invention includes a split vaccine composition comprising a prophylactically effective 

amount of SARS viral antigen, preferably spike or an immunogenic fragment thereof. The 
SARS viral antigen is preferably present in a concentration amount of 0. 1 to 50 pg antigen/dose, 
more preferably 0.3 to 30 /rg antigen/dose. Still more preferably, the antigen is about 15 
/ig/dose. 

0 In one embodiment, a lower concentration of SARS viral antigen is used in split vaccine 

compositions of the invention. Such lower concentration vaccines may optionally comprise an 
adjuvant to boost the host immune response to the antigen. In such a “low dose” vaccine; the 
SARS viral antigen is preferably present in a concentration of less than 15 /ig antigen/dose, (i.e., 
less than 10, 7.5, 5 or 3 pg antigen/dose. 

5 D. Subunit SARS Vaccines 

The invention includes a composition comprising an isolated or purified SARS viral 
antigen or a derivative thereof. The composition may further comprise one or more adjuvants. 

SARS viral antigrais can be isolated or purified from a SARS virus grown in cell culture. 
Alternatively, SARS viral antigens can be recombinantly produced by methods known in the art 
0 The SARS viral antigens used in the invention can be produced in a variety of different 

expression s3/stems which are known in the art; for example those used with mammalian cells, 
baculoviruses, bacteria, and yeast Such expression systems will typically use polynucleotides 
encoding die viral antigens of the invention. Such sequences can be obtained using standard 
techmques of molecular biology, including translating the amino acid sequences listed herein. 

5 Accordingly, the invention includes polynucleotides encoding for the viral antigens of the 
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invesntion. In addition, the viral antigens of the invention can be produced (at least in part, 
preferably in whole) via synthetic chemistry methods. 

Insect cell expression systens, such as baculovirus systems, are known to those of skill in 
the art and described in, e.g., Summras and Smith, Texas Agricultural Ejq>eriment Station 
5 Bulletin No. 1555 (1987). Materials and methods for baculovirus/insert cell expression systems 
are commercially available in kit form from, inter alia, Invitrogen, San Diego CA. Similarly, 
bacterial and mammalian cell expression systems are also known in the art and described in, e.g.. 
Yeast Genetic Engineering (Barr et al., eds., 1989) Butterworths, London. 

A number of appropriate host cells for use with the above systems are also known. For 
10 example, mammalian cell lines are known in the art and include immortalized cell lines available 
fiom the American Type Culture Collection (ATCC), such as, but not limited to, Chinese 
hamster ovary (CHO) cells, HeLa ceUs, baby hamster kidney (BHK) cells, monkey kidney ceUs 
{e.g.. Hep G2), Madin-Darby bovine kidney (“MDBK”) cells, as well as others. Mammalian 
sources of cells include, but are not limited to, human or non-human primate (e.g., MRC-5 
5 (ATCC CCT^ni), WI-38 (ATCC CCL-75), fetd rhesus lung cells (ATCC CL-160), human 
embryonic kidney cells (293 cells, typically transformed by sheared adenovirus type 5 DNA), 
VERO ceUs from monkey kidneys), horse, cow {e.g., MDBK cells), sheep, dog (e.g., MDCK 
cells from dog kidneys, ATCC CCL34 MDCK (NBL2) or MDCK 33016, deposit number DSM 
ACC 2219 as described in WO 97/37001), cat, and rodent (e.g., hamster cells such as BHK21-F, 
0 HKCC cells, or Chinese hamster ovary ceUs (CHO cells)), and may be obtained from a wide 

variety of developmental stages, including for example, adult, neonatal, fetal, and embryo. 

Sunilarly, bacterial hosts such as E. coli. Bacillus subtilis, and Streptococcus spp., will find- 
use with the present expression constructs. Yeast hosts useful in the present invention include, 
inter alia, Saccharomyces cerevisiae, Candida albicans, Candida maltosa, Hansenual 
5 polymorpha, Kluyveromycesfragilis, Kluyveromyces lactis, Pichia guillerimondii, Pichia 
pastoris, Schizosqccharomyces pombe and Yarrowia lipolytica. Insect cells for use with 
baculovirus expression vectors include, inter alia, Aedes aegypH, Autographa califomica, 
Bombyxmori, Drosophila melanogaster, Spodoptera frugiperda, and Trichoplusia ni. 

Nucleic add molecules comprising nucleotide sequences of the viral antigens or antibodies 
) of the invention can be stably integrated into a host cell genome or maintained on a stable 

episomal element in a suitable host cell using various gene delivery techniques well known in the 
art See., e.g., US Patent No. 5,399346. 

Depending on the expression system and host selected, the molecules are produced by 
growing host cells transformed by an expression vector under conditions whereby the protein is 

) expressed. Ibe expressed protein is then isolated from the host cells and purified, ffflie 

expression system secretes the protein into growth media, the product can be purified directly 
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„ from the media, ff it is not secreted, it can be isolated from cell lysates. The selection of the 
q>propriate growth conditions and recovery methods are within the sldll of the art 

ITie invention includes a composition conqnising an isolated or purified SARS viral 
antigen or a derivative thereof. The invention also includes a composition comprising at least 
5 two isolated or purified SARS viral antigens or derivatives theceof, which have been co-purified 

or purified separately and then combined, hi one embodiment, the SARS viral antigen is a spike 
(S) protein, hi yet another embodiment, the SARS viral antigen is a nucleocapsid (N) protein, a 
membrane (M) glycoprotein, or an envelope (E) protein. Preferably, the SARS viral antigen is 
present in the composition in a purity greater than 75% (e.g., 78%, 80%, 82%, 85%, 88%, 90%, 
10 92%, 95%, 98%). 

The invention includes a vaccine composition comprising a prophylactically effective 
amount of SARS viral antigen, preferably spike or an immunogenic fiagment thereof. The 
SARS viral anti^n is prefraably present in a concentration amount of 0.1 to 50 ng antigen/dose, 
more preferably 0.3 to 30 /ig antigen/dose. Still more preferably, the antigen is about 15 
15 /tg/dose. 

In one embodiment, a lower concentration of SARS viral antigen is used in vaccine 
compositions of the invention. Such lower concentration vaccines may optionally comprise an 
adjuvant to boost the host immune response to the antigen. In such a “low dose” vaccine, the 
SARS viral antigen is preferably present in a concentration of less than 15 /ig antigen/dose, (Le., 
iO less than 10, 7.5, 5 or 3 /ig antigen/dose. 

The following example illustrates a method of preparing a SARS virus spike (S) protein 
subunit vaccine. 

SARS virus S antigen may be isolated and purified from a variety of sources and nting a 
variety of methods, including, but not limited to, S antigen expressed in cultured eukaryotic cells 
rn a mmalian cells, such as VERO, CHO) or bacteria (e.g., E. coli). Expression of may be 
achieved by a variety of means, such as, for example, from SARS virus infected cell culture or 
cell culture supernatants, from cultured cells stably transformed with a DNA expression cassette 
encoding the SARS virus S protdn (e.g., RNA polymerase II promoter operably linkofl to a 
SARS virus S gene), or from cultured cells infected with a replication-competent or replication- 
0 incompetent virus-based expression vector (e.g., adenovirus vector, poxvirus vector, alphavirus 
vector, retrovirus vector) encoding the SARS virus S protein, as a means to eliminatp. the need to 
work with infectious SARS virus. 

1. Subunit SARS Vaccines Produced from SARS Virus Cultures 

The SARS virus may be grown in cultured mammalian celle, such as VHRO cells, then 
5 separated from the cultured cells. A SARS viral antigen, such as the S protein, can then be 

solubilized and sq>arated from the SARS virus, and further isolated and purified. 
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Iq one example, the SARS vims may be produced as described in the Inactivated SARS 
vaccine examples, then the desired SARS antigen, such as spike protein, may be further purified 
from the end product using techniques known in the art 

In another example, a SARS subunit vaccine may be produced as follows. SARS vims 
5 may be produced using a desired mammalian cell line on microcairier beads in large, controlled 
fermentors. For example, vaccine quality African Green Monkey kidney cells (VERO cells) at a 
concentration of 10^ cells/mL are added to 60lo 75 L of CMRL 1969 media, pH 7.2, in a ISO L 
bioreactor containing 360 g of Cytodex-1 microcartier beads and stirred for 2 hours. Additional 
CMRL 1969 is added to give a total volume of 150 L Fetal bovine serum (PBS) is to a 
10 final concentration of 3.5%. Glucose is added to a final concentration of 3.0 g/L and glutamine 
is added to a final concentration of 0.6 g/L Dissolved oxygrai, pH, agitation and temperature are 
controlled, and cell growth, glucose, lactate and glutamine levels are monitored. When cells are 
in logarithmic phases usually on days 3 to 4 reached a density of about 1.0-2.5x10® cells/mL, the 
culture medium is drained from the fermenter and 120 L of CMRL 1969, pH 7.2 (no FBS) is 
15 added and the culture stirred for 10 minutes. The draining and filling of the fermenter is usually 
repeated once but could be repeated up to three times. Afto: washing the cells, the fermenter is 
drained and 50 L of CMRL 1969 containing 0.1% (v/v) FBS is added. The SARS virus 
inoculum is added at a multiplicity of infection (m.o.i.) of 0.001 to 0.01. Trypsin may be added 
to promote efficient infection. Additional CMRL 1969 with 0.1% FBS is added to give a final 
20 volume of 150 L. Incubation is continued at 34 C. One viral harvest is obtained from a single 
fermenter lot, typically at 2-7 days post-infection. Multiple harvests fix>m a single fermentation 
may also be obtained. 

The isolation and purification of S protein may be effected by a variety of tnftang ^ 
described below. For example, collecting S protein-containing flow-throu^ from ion exchange 
25 chromatography of solubilized SARS virus envelope proteins; loading the flow through onto a 
hydroxyapatite matrix, and selectively eluting the S protein from the hydroxyapatite matrix. The 
selectively eluted S protein may be further concentrated by tangential flow ultrafiltration. 

Alternatively, the isolation and purification may be effected by collecting S protein- 
containing flow-through from ion exchange chromatography of the solubilized SARS virus 
JO envelope proteins; loading the flow through onto a hydroxyapatite matrix and collecting an S 
protein-containing flow throu^, selectively removing detergent used in the solubilization step 
from the hydroxyapatite matrix flow through to provide isolated and purified S protein. The 
isolated and purified S protein may be subsequently concentrated by tangential flow 
ultrafiltration 
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Nucleic acid contaminants may be removed firom the isolated and purified S protein by 
treatment with a nucleic acid degrading agent as described above in the Inactivation section. 
Preferably, the nucleic acid degrading agent is a nuclease, such as for example. Benzonase. 

The isolated and purified S protein may be applied to a gel filtration medium and the S 
5 protein subsequently coUected therefrom to separate the S protein fiom contaminants of other 
molecular weights. 

Alternatively, the isolation and purification may be effected by loading S protein on a first 
ron-exchange medium while permitting contaminants to pass through the medium, eluting the S 
protein fiom the first ion-exchange medium, to separate the S protein fiom contaminants of other 
10 molecular weights. The eluted S protein is applied to a second ion-exchange medium while 
allowing contaminants to pass through the second ion-exchange medium. The S protein is 
subsequentiy eluted therefiom. to provide the isolated and purified S protein. The eluted S 
protein may be. concentrated by tangential flow ultrafiltration. 

Alternatively, substantiaUy pure SAKS virus S protein suitable for use as an immunogen in 
5 a subunit vaccine formulation may be prepared from infected ceU lysates, such as for example 
using a non-denaturing detergent buffer containing 1% Triton X-100 and deoxycholate to lyse 
infected ceUs. The cell lysates are clarified by centrifugation and S protein is purified from the 
ceU lysates by immunoaffinity purification. A monoclonal antibody against the S protein is 
generated and coupled to beads and a column is constructed with those beads. SARS-infected 
0 cell lysates are applied to the column, and the column is washed with PBS containing 0.1% 

Triton X-100. Protein bound to the column is eluted with O.IM glycine, pH 2.5, 0.1% Triton X- 
100. Elution s^les are buffered, such as for example, with Tris, and analyzed for the presence 

of protein. Fractions containing the protein are pooled and dialyzed against PBS 

As discussed above, the present invention includes isolated and purified S protein of SARS 
5 vims. In one example, the virus is grown on a vaccine quality cell line, such as VERO cells, and 
the grown virus is harvested. The virus harvest is filtered and then concentrated typically using 
tangential flow ultrafiltration using a membrane of desired molecular weight cut-off and 
diafiltered. The virus harvest concentrate may be centrifuged and the supernatant discarded. The 
peUet from the centrifugation then is detergent extracted to solubilize the S protein, for example, 

) by resuspending the pellet to the original harvest concentrate volume in an extraction buffer 
containing a detergent such as a non-ionic detergent including TRITON X-100. 

FoUowing centrifugation to remove non-soluble proteins, the S protein extract is purified 
by chromatographic procedures. The extract may first be applied to an ion exchange 
chromatography column such as a TMAE-fractogel or S-fiactogel column equilibrated to permit 
the S protein to flow through while impurities are retained on the column. 
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Next, Ite flow through may be loaded onto a hydroxyapatite eolumn, equilihrated to pennit 

bmdmg of the S proteiu to the matrix and to permit contaminants to pass ftom the coinnm. The 

bound S protein is then eluted fiom the column by a suitable eintant The lesnlting purified 

solution of S pmtein may be further pnxtessed to inerease its purity. The eluate first may be 

5 concentrated by tangential flow ultrafiltmtion using a membrane of deshed molecular weight 

cut-off. The filtrate may be contacted with a polyethylene glycol of desired molecular weight, 

for example, about 6000 to 8000, to ptecipitate the protein. Following oeotriftigation and discard 

of the supernatant, the peUet may be resuspended in PBS and dialyzed to remove the 

polyethylene glycol. Finafly, the dialyzed sointion of S protein may be sterile filtered. The 

.0 sterile filtered solution may be adsothed onto alum. The polyethylene glycol precipitation and 

resuspenston purification step may be effected at an earlier stage of the purification operation if 
desired: 

Alternatively, SAKS virus is recovered following growth and harvesting of the virus, and a 
concentrate obtained such as, for example using PEG precipitation or tangenfial flow filtration. 

The vnus IS contacted with detergent to solubilize the S proteins. FoUowing centrifugation, the 

supernatant is recovered to further purification of the S protein and the non-soluble proteins 
discarded. 

The supernatant is applied to an ion exchange chromatography column, such as a TMAE- 
ftactogel or S-fiactogel column, suitably equilibrated to permit retention of the S protein on the ' 

) column. The S protein is eluted from the ion-exchange column under suitable conditions The 

eluate then may be passed through a gel filtration column, such as a Sephacryl S-300 column, to 

separate the S protein fiom contaminants of other molecular weights. A hydroxyapahte column 

may be employed in place of the Sephacryl column. 

The S protein may be eluted from the column to provide a purified solution of S protein 
The eluate may be concentrated by tangential flow ultrafiltration using a membrane of desired 
molecular weight cut-off. The concentrated S protein solution then may be sterile filtered. 

Alternatively, viral harvests may be concentrated by ultrafiltration and the concentrated 
viral harvests may be subjected to an initial purification step, for example, by gel filtration 
chromatography, polyethylene glycol precipitation or Cellufine sulfate chromatography. The 
purified virus may then be detergent extracted to solubilize the S protein. Following 
solubilization of the S protein, the supernatant may be loaded onto an ion-exchange column such 
as Cellufine sulfate chromatography column equilibrated to pennit the protein to bind to the 
column while permitting contaminants to flow through. Similarly, a TMAE-fractogel or S- 
fractogel column may be used in place of the CeUufine sulfate column. The two columns also 
may be combined in sequential purification steps. The S protein is eluted from the columns to 
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provide a purified solution of the protein. This solution may be concentrated by tangential flow 
ultrafiltration using a membrane of desired molecular weight cut-off and diafilteced. 

Specifically, in one method of S protein purification, the virus harvest concentrate is 
centrifuged at 28,000 x g for 30 minutes at 4 C. The supernatant is discarded and the pellet 
5 resuspended in extraction buffer consisting of 10 mM Tris-HQ, pH 7.0, 150 mM NaQ, 2% 

(w/v) Triton X-100 to the original harvest concentrate volume. Pefabloc is added to a final 
concentration of 5 mM. The suspension is stirred at room temperature for 30 minutes. The 
supernatant, containing the soluble S protein, is clarified by centrifugation at 28,000 x g for 30 
min utes at 4 C. A TMAE— ftactogel column is equilibrated with 10 mM Tris-HCl, pH 7.0, 150 
to mM NaCl containing 0.02% Triton X-100. The Triton X-100 supernatant, containing the soluble 
S protein, is loaded directly onto the TRAE-Fractogel colunm. The total volume added plus 2 
bed volumes of 10 mM Tris-HCl, pH 7.0, 150 mM NaQ containing 0.02% Triton X-100 are 
collected. The TMAE— Fractogel flow-through containing S protein is diluted 3-fold with 10 
mM Tris-HQ, pH 7.0, containing 0.02% Triton X-100. 

15 An hydroxyapatite column is equilibrated with 10 mM Tris-HQ, pH 7.0, 50 mM NaQ, 

0.02% Triton X-100. After loading the TMAE flow-throu^, the column is washed with 2 
colunm volumes of 10 mM Tris-HQ, pH 7.0, 50 mM NaQ , 0.02% Triton X-100 followed by 4 
column volumes of 5 mM sodium phosphate, pH 7.0, IM NaQ, 0.02% Triton X-100. The 
proteins are eluted with 4 column volumes of 20 mM sodium phosphate, pH 7.0, IM NaQ , 

20 0.02% Triton X-100. Fractions are collected based on A280 and the protein content and antigen 

concentrations are measured. The purified S protein is ultrafiltered by tangential flow 
ultrafiltration using a 300 kDa NMWL membrane. 

2. Recombinant Production of Subunit SARS Vaccines 

As discussed above, SARS virus proteins inay be produced by recombinant expression. 

25 Host cells suitable for recombinant expression include bacterial, mammalian, insect, yeast, etc. 
Recombinant expression may be used to produce a full length SARS protein, a fragment thereof, 
or a fusion therewith. 

Fusion peptides may be used to facilitate the expression and purification of the 
recombinant SARS protein. For example, recombinant production of the SARS polypeptides 
30 can be facilitated by the addition a tag protein to the SARS antigen to be expressed as a fusion 
protein comprising the tag protein and the SARS antigen. Such tag proteins can facilitate 
purification, detection and stability of the expressed protein. Tag proteins suitable for use in the 
invention include a polyarginine tag (Arg-tag), polyhistidine tag (His-tag), FLAG-tag, Strep-tag, 
c-myc-tag, S-tag, calmodulin-binding peptide, cellulose-binding domain, SBP-tag„ chitin- 
35 binding domain, glutathione S-transferase-tag (GST), maltose-binding protein, transcription 

termination anti-terminiantion factor (NusA), E. coli thioredoxin (TrxA) and protein disulfide 
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isomerase I (DsbA). Prefraied tag protems include His-tag and GST. A fall discussion on the 
use of tag proteins can be found at Tope et aL, “Overview of tag protein fusions: from 
molecular and biochemical fundamentals to commercial systems”, Appl NEcrotriol Biotechnol 
(2003)^:523-533. 

5 After purification, the tag proteins may optionally be removed from the expressed fusion 

protein, i.e., by specifically tailored enzymatic treatments known in the ait Commonly used 
proteases include enterokinase, tobacco etch virus (TEV), thrombin, and factor Xa. 

Accordingly, the invention furthCT includes a SARS virus subunit vaccine comprising a 
fusion protein. Preferably, the fusion protein comprises a first amin o acid sequence encoded by 
10 a SARS virus polynucleotide sequence. SARS virus polynucleotide sequences which may 
encode said first amino add sequence include one or more of the SARS virus polynucleotide 
sequences identified in this application and fragments thereof. 

Tbe fusion protein may comprise an amino add sequence of a SARS virus protein or a 
fragment thereof. Said SARS virus protein may be selected from one or more of the group 
[5 consisting of the following SARS virus proteins: P28, P65, Nspl, Nsp2 (3CL protease), Nsp3, 
Nsp3, Nsp4, Nsp 5, Nsp6, Nsp 7, Nsp S, Nsp 9 (RNA polymerase), Nsp 10 (heficase), Nsp 11, 
Nsp 12, Nsp 13, Spike, Oif 3, Orf 4, Envelope, Matrix, Orf 7, Orf 8, Orf 9, Orf 10, Orf 1 1, 
Nucleocapsid and Qrf 13. 

In one embodiment, the fusion protein comprises a first amino acid sequence comprising a 
.0 SARS virus antigen or a fragment thereof. Said SARS virus amino add sequence may comprise 
one or more of the T-epitope sequences identified above. 

Preferably, the fusion protein comprises an amino acid sequence of a SARS virus spike 
protein, or a fragment thereof. Specific fragments of the spilm protein which ma y be used in the 
fusion protein include the SI domain and the S2 domain. Further fiagments of the spike piotdn 
5 which may be used in the fusion protein include regions of each of the SI and S2 Homaing 
including the receptor binding region of the SI domain, the oligo me rization domain regions of 
the S2 domain, the leudne zipper regions of the S2 domain, the membrane anchor region of the 
S2 domain, the hydrophobic domain region of the S2 domain, the cystein-rich domain region of 
the S2 domain, and the cytoplasmic tail region of the S2 domain. (See FIGURE 19). Amino 
0 acid sequmices of the Spike protein corresponding to these regions can be idaitified by those 

skilled in the art, including, for example, using the functional predictions set forth earlier in the 
^plication (predicted transmembrane helices, predicted N-terminus signaling regions, predicted 
coded-coil regions, etc.) as wefi as by homology comparison to the sequences of other known 
Coronaviruses (See FIGURES 4F and 5). 

5 The fusion protein may further comprise a second amino add sequence. Said second 

amino add sequence may comprise a polypeptide sequence which facilitates protein expression 
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or purificatioii, preferably one of the tag sequences discussed above. Alternatively, said second 
amino acid sequence may comprise a second amino acid sequence from a SARS virus. 
Alternatively, sard second amino acid sequence may comprises an amino acid sequence from 
another virus or bacteria, including one or more of the viruses or bacteria identified in Section I, 

5 below. 

Said second amino acid sequence may comprise an amino acid sequence fium another 
respiratory virus. Said second amino acid sequence may comprise an amino acid sequence from 
a virus selected from the group consisting of coronavirus, influenza virus, rhinovirus, 
parainfluenza virus (PIV), respiratory syncytial virus (RSV), adenovirus, and metapneumovirus. 
10 In one embodiment, said second amino add sequence may comprise an amino acid 

sequence from an adjuvant, including one or more of the adjuvants identified in section I, below. 

In one embodimmit, the invention includes a fusion protein comprising an amino acid 
sequence of a SARS vinis spike protein or a jfiagment thereof. The fusion protein may .further 
comprise a second amino acid sequence comprising an amino add sequence selected fiom the 
15 group consisting of a second SARS virus protein, a non-SARS virus protein, a bacterial protein, 
and an adjuvant 

(a) Bacterial Expression of Subunit SARS Vaccines 

In one embodiment, bacterial host cells are used for recombinant expression of SARS virus 
protdins. Bacterial host cells suitable for use in the invention include, for e xamp le, E. coli, 
iO Bctcillus subtilis, and Streptococcus spp. 

The SARS viral protein may be modified to facilitate bacterial recombinant expression. In * 
particular, the SARS spike protein may be modified to facilitate transport of the spike protein to 
the surface of the bacterial host cell. 

i^pplicants have discovered that tiiere is strong structural homology between the SARS 
15 virus spike protein and the NadA protein of Neisseria meningitidis. Both proteins have an N- 

terminal globular head domain (amino acids 24-87), an intermediate alpha-helix region with 
high propensity to form coiled-coil structures (amino adds 88-350), and a C-teiminal mpmhraTipt 
anchor domain formed by four amphipatix transmembrane beta strands (amino acids 351-405 of 
NadA). In addition, a leucine zippm: motive is present within the coiled-coil segment See, 

10 FIGURE 19 depicting the SARS spike protein structure Comanducci et al., “NadA, a Novel 
Vacdne Candidate of Neisseria meningitidis”, J. Exp. Med. 195 (11): 1445-1454 (2002). In 
addition, a leucine zipper motif of NadA is present within the coiled-coil segment. The NadA 
protdn also forms high molecular weight surface-exposed oligomers (corresponding to three or 
four monomers) anchored to meningococcal outer membrane. 

»5 When the NadA protein is expressed in E. coli, the full-length protein is assembled in 

oligonimn anchored to the outer membrane of E. coli, similar to the way the protein is presented 
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in meningococcus. The NadA protein devoid of the predicted membrane anchor domain is then 
secreted into the culture supernatant This secreted protein is soluble and still organized in 
trimers. 

The invention therefore includes a fusion protein comprising an amino acid sequence of a 
5 SARS virus spike protein or a fragment thereof and a second amino add sequence of a bacterial 
adhesion protein or a fragment thereof. Preferably, said adhesion protein is selected from the 
group consisting of NadA, YadA (of enteropathogenic Yersinia), and UspA2 (of Moraxella 
catarHudis). Additional NadA-like proteins include serum resistance protein DsrA of 
Haemophilus ducreyi, the immunoglobulin binding proteins EbA, C, D, and F of E. coli, outer 
.0 membrane protein 100 of ActinobacUlus actinomycetemcomitans, the saa gene carried on the 
large virulence plasmid present in shiga toxigenic strains of E. coli (STEC), and each of the 
bacterial adhesion proteins described in U.K Patent Application No. 0315022.4, filed on June 
26, 2003, each of which are specifically incorporated herein by reference. 

Preferably, said adhesion protein comprises NadA or a fingment thereof. 

IS Such fusion proteins may be used to facilitate recombinant expression of immunogenic 

portions of SARS surface antigens, such as spike. These fusion constructs may also allow the 
SARS SI and/or S2 domains to adapt to a native confirmation. These fusion proteins are also 
able to oligomerize and form dimers or trimers, allowing the S 1 and/or S2 domains to associate 
and adapt conformations as in the native SARS spike protein. Further, these expression 
10 constructs facilitate surface exposure of the SARS spike protein. 

Ihe fusion proteins of the invention preferably comprise a leader peptide from a NadA like 
protein, preferably NadA, a polypeptide from die immunogenic “head” region of the spike 
protein, and a stalk region from either the NadA like protein or the Spike protein. During 
expression and processing of the fusion protein, one or more amino acids may be cleaved off or 
15 removed, such as, t.e., the leader peptide or a membrane anchor domain. 

The stalk regions facilitate oligomerization of the expression protein. Optionally, the 
fusion proteins of the invention further include an anchor region of a NadA like protein. This 
anchor region allows the expression fusion protein to anchor and assemble on the bacterial ceU 
surface. 

30 The fusion proteins of the invention include the follovkdng constructs; 

(i) the NadA leader peptide (optionally also including the first 6 amino acids of the mature 
NadA protein to facilitate processing of the lead^ peptide and appropriate maturation of the 
protein) followed by the Spike SI domain. Preferably, this construct comprises amino acids 1- 
29 of NadA (corresponding to the NadA leader peptide and the first 6 amino acids of the mature 
35 NadA protein, as shown in FIGURE 22 and as set forth below) followed by amino adds 14-662 
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of a SARS virus Spike protein (corresponding to the SI domain, see FIGURE 19 and SEQ ID 
NO: 6042 and as set forth below). SpecificaUy, construct (i) comprises SEQ ID NO: 7302. 

(ii) the NadA leader peptide (optionally also including the first 6 amino adds of the mature 
NadA protein to facilitate processing of the leader peptide and appropriate maturation of the 

5 protein) followed by the Spike SI domain, followed by the stalk and anchor membrane domains 

of NadA. Preferably, this construct comprises amino adds 1-29 of NadA (corresponding to the 
NadA leader peptide and the first 6 amino adds of the mature NadA protein, as shown in 
FIGURE 22 and as set forth below) followed by amino acids 14-662 of a SARS virus Spike 
protdn (corresponding to die SI domain, see FIGURE 19 and SEQ ID NO: 6042 and as set forth 
10 below) followed by amino adds 88-405 of NadA (corresponding to the stalk and the anchor 
membrane domains). Specifically, constmct (ii) comprises SEQ ID NO: 7303. 

(iii) the NadA leader peptide (optionally also including the first 6 amino adds of the 
mature NadA protein) followed by a SARS virus Spike SI domain, followed by the NadA stalk 
domain. Preferably, this construct comprises amino adds 1-29 of NadA followed by amino 

15 adds 14-662 of a SARS virus Spike protein (corresponding to the SI domain), followed by 
amino adds 88-350 of NadA (corresponding to the stalk domain). Specifically, constmct (iii) 
comprises SEQ ID NO: 7304. 

(iv) the NadA leader peptide (optionally also including the first 6 amino acids of the 
mature NadA protein), followed by a SARS virus Spike SI and S2 domain (excluding the 

20 putative transmembrane region), followed by the anchor domain of NadA. Preferably, this 
constmct comprises amino acids 1-29 of NadA, foUowed by amino acids 14-1195 of a SARS 
virus Spike protein (corresponding to SI and S2, excluding the putative transmembrane region),, 
followed by amino adds 351-405 of NadA (corresponding to the NadA anchor domain). 
Specifically, constmct (iv) comprises SEQ ID NO: 7305. Altenatively, the NadA anchor 
25 domain may comprise amino adds 332 - 405 of NadA. 

(v) the NadA leader peptide (optionally also including the first 6 amin o acids of the mature 
NadA protein), followed by a SARS virus Spike SI and S2 domain (exclusing the putative 
transmembrane region). Preferably, this construct comprises amino adds 1-29 of NadA, 
followed by amino adds 14-1 195 of a SARS vims Spike protein. Spedfically, constmct (v) 

30 comprises SEQ ID NO: 7306. 

In each of constmcts (i) to (v), the first 23 amino acids are the NadA leader peptide, and 
the 6S dip^tide at residues 679-680 arises from the insertion of a restriction enzyme site. 

In constructs (i), (ii) and (iii), the NadA “head” is replaced by the Spike SI domain, and 
the fusion proteins are anchored to the outer membrane of E.coli or secreted in the culture 
35 supernatant, respectively. In constmcts (iv) and (v), the “head” and “stalk” domains of NadA are 
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replaced by S 1 and S2 Spike domains; also in diis case, the two fusion proteins are anchored to 
the out^ membrane of E.coli or secreted in die culture supernatant, respectively. 

Accordingly, the invention further includes a fusion protein com prisin g an amino add 
sequence of a SARS virus spike protein or a fragment thereof and a second amino add sequence 
5 of a bacterial adhesion protdn or a fragment tiiereof. Preferably, amino acids corresponding to 

the “head” of the adhesion protein are replaced by amino adds corresponding to a SARS virus 
Spike SI domain. Alternatively, die amino acids corresponding to the “head” and “stalk” 
domains of the bacterial adhesion protein are replaced by ammo adds corresponding to the 
SARS virus spike protdn SI and S2 domains. 

10 As discussed above and shown in Figure 19, the SI domain of the Spike protein is 

identified as the ^obular receptor binding “head” region. The SI domain of the Spike protein 
preferably comprises about amino adds 14-662 of SEQ ID NO: 6042. The SI domain may 
comprise a shorter amino acid sequence, wherein amino adds are removed from either the N- 
terminal or C-terminal regions. Preferably, 3, 5, 7, 9, 13, 15, 20 or 25 amin o acids are removed 
15 from either the N-terminal or C-terminal regions. The SI domdn further includes amino add 

sequences having sequence identity to the SI region of SEQ ID NO: 6042. An example of the 
S 1 domain is SEQ ID NO: 7307: 

As discussed above and shown in Rgure 19, the S2 domain of the Spike protein is 
identified as the “stalk” region. The “stalk” region comprises oligomerization domain regions, a 
20 . leucine zipper domain regions, membrane anchor regions, hydrophobic domain regions, cystein- 
rich domain region and a cytoplasmic tail region. The S2 dn-main of the Spike protein preferably 
excludes the transmembrane region and comprises about amino adds 663-1 195 of SEQ ID NO: 
6042. The S2 domain may comprise a shorter amino add sequence, wherein amino adds are 
removed from either the N-terminal or C-terminal regions. Preferably, 3, 5, 7, 9, 13, 15, 20 or 25 
25 amino adds are removed from either the N-terminal or C-terminal regions. The S2 domain 

further includes amino acid sequences having sequence identity to the S2 region of SEQ ID NO: 
6042. An example of the SI domain (with the transmembrane region excluded) is SEQ ID NO: 
7308. 

An example of the NadA protein described above is SEQ ID NO: 7309. As discussed 
30 above, the leader sequence of NadA used in the fusion protein preferably comprises about the 

first 29 amino acids of NadA (including a leader sequence with about 6 amino acids of the NadA 
head protein). Examples of such a leader sequences are set forth as SEQ ID NOS: 7310 and 
7311 below. The fusion protein may use a leader sequraice comprising a shorts amino acid 
sequence, wherein amino adds are removed from either the N-tominal or C-tominal regions. 

35 Preferably, 1, 2, 3, 4, or 5 amino adds are removed from either the N-terminal or C-te rminal end 
of the sequence. Ibe leader sequence used in the fusion protein may also include an amino add 
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sequences having sequence identity to SEQ ID NO: 7310 or SEQ ID NO: 7311. Preferably, the 
leader sequence comprises SEQ ID NO: 7311. 

Optionally, the fusion peptide comprises about the iBrst 6 amino acids of the mature NadA 
protein to facilitate processing of the leader peptide and appropriate maturation of the protein. 

5 An examples of the first 6 amino acids of a mature NadA proteins is SEQ ID NO: 7312.. 

As discussed above, the stalk and anchor sequences of NadA used in the fusion protein 
preferably comprise about amino acids 88-405 of NadA. An example of an amino acid sequence 
comprising NadA stalk and anchor regions is set forth below as SEQ ID NO: 7313 below. An 
example of an amino add sequence comprising a NadA stalk region (without the anchor region) 
10 is set forth as SEQ ID NO: 7314 below. An example of an amino acid sequence comprising a 
NadA anchor region is set forth as SEQ ID NO: 7315 below. The fusion protein may use a stalk 
(and/or anchor) sequence comprising a shorter amino acid sequence, wherein amino adds are 
removed from eidier the N-terminal or C-tenninal re^ons. Preferably, 1, 2, 3, 4, 5, 6, 7j 8 or 9 
amino adds are removed from either the N-terminal or C-terminal end of the sequence. The 
15 leader sequence used in the fusion protein may also include an amino add sequences having 

sequence identity to the SEQ ID NO: 7313. 

The fusion protdns of the invention, including diose described above, may be prepared, for 
example, as follows. Single fragments (such as the regions described above) may be amp lifi ed 
by PCR using the oligonucleotide primers set forth in the Table below. (S 1 l refers to the Spike 
20 protein fused to the leader peptide of NadA; S2 refers to the stalk region of the Spike protein, 
with and without the stop codon). The oligonucleotides were designed on the basis of the DNA 
sequence of NadA fium N. meningitidis B 2996 strain and of Spike from SAKS virus isolate 
FRAl. Each oligonucleotide includes a restriction site as a tail in ord^ to direct the cloning into 
the expression vector pET21b. 







SEQ ID NO: 


Restriction site 


SIl 


For 


7316 


Ndel 


SIl 


Rev 


7317 


Bamin 


S2 


For 


7318 


BamHI 


S2 


Rev 


7319 


Hindin 


S2-stop 


Rev 


7320 


Xhol 


NadAgg 


Pbr 


7321 


BamHI 


NadAsso 


Rev 


7322 


Xhol 


NadAg32 


For 


7323 


Hindin 


NadA405 


Rev 


7324 


Xhol 



The single fragments are sequentially cloned into pET21b vector, in order to express the 
proteins under the control of inducible T7 promoter. The SI domain of the Spike protein fused 
to the leador peptide of NadA (SIl) was obtained by PCR using the primers Slt-For and SIl- 
Rev. The forward oligonucleotide primer contains the Ndel restriction sequence and the 
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sequence coding for the leader peptide of NadA plus the first 6 aminoacids of the mature protein. 
The PCR fragment was cloned as a Ndel/BamHl fiagment in die pET21b vector opened with the 
same restriction enzymes. This clone (pET-S 1 l) was then used to sequentially clone the oth^ 
diffoent domains, as BamHI/XhoI, BamHI/ffindni or HindlllZXhoI fragments. BamHI and 
5 Huidm restriction sites introduce the aminoacids GS and KT -. respectively. 

The PCR amplification protocol was as follows: 200ng of genomic DNA from Neisseria 
meningitidis 2996 or 10 ng of plasmid DNA preparation (plasmid pCMVnew, containing the 
entire gene coding of the Spike protein), were used as template in the presence of 40nM of each 
oligonucletide primer, 400-800 pM dNTPs solution, lx PCR buffer (including l.SmM MgCk), 

10 2.5 units TaqI DNA polymerase (using Perkin-EImer AmpUTaQ or Invitro^n Platinum Pfe 

DNA polymerase). 

After a preliminary 3 minute incubation of the whole mix at 95°C, each sample underwent 
a two-step amplification: the first 5 cycles were performed using the hybridisation temperature 
that excluded the restriction enzyme tail of the primer (Tml). This was foUowed by 30 cycles 
15 according to the hybridisation temperature calculated for the whole length oligos (Tm7) 

Elongation times, performed at 68°C or 72“C, varied according to the length of the fragment to 
be amplified.The cycles were completed with a 10 minute extension step at 68“C or 72°C. 

The amplified DNA was either loaded directly on agarose gel and the DNA fiagment 
corresponding to the band of correct size was purified from the gel using the Qiagen™ Gel 
to Extraction Kit, following the manufacturer’s protocol. 

The purified DNA corresponding to the amplified fiagment and the plasmid vectors' were 
digested with the appropriate restriction enzymes, purified using the QIAquick™ PCR 
purification kit (following the manufacturer’s instructions) and ligation reactions were . 
performed. 

5 The ligation products were transformed into competent K coli DH5a and screening for 

recombinant clones was performed by growing randomly-selected colonies and extracting the 
plasmid DNA using the Qiagen QIAprep Spin Miniprep Kit, following the manufacturer’s 
instructions. 

Recombinant plasmids were introduced into E. coli BL21(DE3) used as expression host 
0 Single recombinant colonies were inoculated into LB + an^cillin and incubated at 37°C for 14- 
16 h. Bacteria were directly recovered by centrifugation (uninduced conditions) or diluted in 
fiesh medium and grown at 37“C until ODeoo between 0.4-0.8. Protein expression was induced 

by addition of 1 mM Isopropyl-l-thio-P-D-galactopyranoside (IPTG) for three hours (induced 
conditions). 

5 Whole cell lysates were obtained resuspending bacteria in SDS-sample buffer IX and 

boiling for 5-10 min. Equal amounts of proteins were separated using NuPAGE (Invitrogen) or 
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^ BIORAD Gel System, according to the manufacturer’s instructions. Proteins were revealed by 
Coomassie-blue staining or transferred onto nitrocellulose membranes for western blot analysis. 
Western blot was performed using a rabbit polyclonal anti-serum against purified NadA^asi-ws 
(diluted 1:3000) and a secondary peroxidase-conjugate antibody (DAKO). 

5 Results of the expression in Exoli of S 1 l, SlL-NadA and S iL^NadA^anchor are shown in 

FIGURES 38 and 39. Schematics of the fusion constructs are shown in FIGURE 37. 

Bacterial expression of the S ARS viral antigens may also be used to prepare compositions 
comprising outer membrane vesicles wherein said outer membrane vesicles comprise one or 
more SARS viral antigens. 

10 Outer Membrane Vesicles (“OMV’), also referred to as blebs, refe to vesicles formed or 

derived from fragments of the outer membrane of a Gram negative bacterium. OMVs typically 
comprise outer membrane proteins (OMPs), lipids, phospholipids, periplasmic material and 
lipopolysacchaiide (IPS). Gram negative bacteria often shed OMVs during virulent infections 

in a process Imown as blebbing. QA4Vs can also be obtained from Gram negative bacteria via a 

15 number of chemical denaturation processes, such as detergent extraction. Synthetic OMVs or 

liposomes, comprising a lipid bilayer and typically enclosing an aqueous core, can also be 
prepared with the SARS viral antigens of the invention. 

Ibe OMVs of the invention are preferably lipid vesicles comprising a lipid bilayer 
surrounding an aquous core. Typically the lipid vesicles are of unilamellar stracture (t.e. , a 
10 single lipid bfiayer surrounds the aquous core), although multilammelar lipid vesicles may also 
be used in the compositions of the invention. OMVs typically have sizes in the nanomolar to 
micromolar range, e.g., from 1 nM to 100 pM, more typically from lOnM to 10 pM and 
preferably from 30 nM to 1 pM. 

The OMVs of the invention are preferably prepared from gram negative bacteria. Gram 
15 negative bacteria are diose bacteria that fail to resist decolori 2 :ation in the commonly known 

Gram staining method. Gram negative bacteria are characterized by a complex multilater cell 
wall and often possess an outer layer polysaccharide capsule. Gram negative bacteria suitable 
for producing OMVs include, for example, species from Neisseria, Moraxella, Kingella, 
Acinetobacter, Brucella, BordeteUa, Chlamydia, Porphyromonas, Actinobacillus, Borelia, 
iO Serratia, Campylobacter, Helicobacter, Haemophilus, Escherichia, Legionella, Salmonella, 
Pseudomonas and Yersinia. 

The OMVs of the invention preferably comprise one or more SARS viral antigens or a 
fragment thereof. The SARS viral antigens may be lecombinantly expressed in a Gram negative 
bacterial host cell and then.harvested witfi the OMV. 

5 Antigenic components, such as recombinantly expressed SARS viral antigens, ma y be 

located in any or all of the three main compartments of the lipid vesicles, including attached to 
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either tiie interior or extenor surface of the lipid vesicle, for example via a memh i-anp, anchor 
domain, or attachment to a lipid moiety; inserted into the lipid bUayer, for example where the 
antigenic component is itself a hydrophobic or lipid based aitity; or located within the aqueous 
center or core of the lipid vesicle. 

5 Synthetically prepared OMVs, or liposomes, may be used in the invention. Such 
liposomes may comprise a number of different lipids and fatty acids. Suitable lipids for 
inclusion in liposomes of the invention include but are not limited to phophatidylinositol-(4,5)- 
diphosphate, phosphatidylserine, phosphatidylcholine, phosphaddylethanolamine, 
phosphatidyglycerol, cholesterol, beta-oleolyl-gamma-palmitoyl, lipopolysaccharides and 
10 galactocerbrosides. 

Suitable means for extraction of OMVs from bacterial sources include deoxycholate 
extraction, Tris/HCl/EDTA extraction, and lithium acetate extraction. Preferably, the extraction 
process comprises a physical and/or chemical means to disrapt the bacterial cell outer membrane 
in order to release sufficient OMVs for purification and isolation. See, e.g., WO 03/051379. 

15 The OMVs of the invention may be enriched and/or supplemented with antigenic 

cornponents, such as SARS viral antigens, by methods known in the art, including, for example, 
direct combination in vitro where an ener^tic combination step can optionally be applied to 
facihtate integration of the antigenic component into a compartment of the liposome. Methods 
of energetic combination suitable for use in the invention include homogenization, 

10 ultr^onication, extmsion, and combinations thereof. 

Preferably, the antigenic component, such as the SARS viral antigen, is recombinantly 
produced by the host cell from which the OMV is derived. In one embodiment, such OMVs are 
prepared by introducing nucleic acid sequence encoding for the SARS viral antigen into the 
recombinant host cell. Preferably the nucleic acid sequence encoding for the SARS viral antigen 
55 is coritrolled by a strong promoter sequence. Preferably, the nucleic acid sequence encoding the 
SARS viral antigen further comprises an outer-membrane targeting signal. For example, the 
nucleic acid sequence encoding the SARS viral antigen may be fused to a sequence encoding for 
a naturally occurring outer membrane protein of the bacterial host Preferably, the nucleic acid 
sequence encoding the SARS viral antigen is fused to the signal peptide sequence of the 
■0 naturally occurring outer membrane protein of the bacterial host 

Methods of preparing an optimizing OMVs for use in vaccines are disclosed in, for 
example Filip et oL, J. Boot. (1973) 115; 717-722; Davies etal.,J. Immunol. Method (1990) 
143:215-225; and WO 01/09350. 

In one embodiment, a bactmal host cell, such as E. coli, are transformed to express the 
5 SARS spike protein. As discussed above, the spike protein may be modified to facilitate 

bacterial expression and transport of the spike protein to the surface of the host ceU. Each of the 
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Spike/NadA fusion constructs discussed above may be used in the OMV pi^>aiations of the 
invention. Preferably, constructs comprising the spike SI globular head domain fused to the 
stalk region of NadA are used to generate OMVs. The construct may optio nall y include the 
NadA leader peptide as weU as the NadA anchor peptide. Schematic diagrams of these preferred 
5 OMV constructs are depicted in FIGURE 49. 

Example 6 describes one method of preparing the OMVs of the invention. 

(h) Mammalian Ejqtression of Subunit SARS Vaccine 

As discussed above, mammaUan host cells may be used for recombinant expression of 
SARS virus proteins. Mammalian host cells suitable for use in the invention include, for 
.0 example, Chinese hamster ovary (CHO) cells, HeLa ceUs, baby hamster kidney (BHKO ceUs, 
monkey kidney cells (e.g. , Hep G2), Madin-Darby bovine kidney (“MDBK”) cells, as well as 
others. MammaUan sources of ceUs include, but are not Umited to, human or non-human primate 
(e.g., MRC-5 (ATCC CCL-171), WI-38 (ATGC CCL-75), human embryonic kidney cells (293 
cells, typically transformed by sheared adenovirus type 5 DNA), VERO cells fiom monkey 
5 kidneys (including, for example COS7 cells), horse, cow (e.g.. MDBK cells), sheep, dog (e.g.. 
MDCK ceUs fiom dog kidneys, ATCC CCL34 MDCK (NBL2) or MDCK 33016, deposit 
nuniber DSM ACC 2219 as described in WO 97/37001), cat, and rodent (e.g., hamster cells such 
as BHK21-F, HKCC ceUs, or Chinese hamster ovary cells (CHO cells)), and may be obtained 

fiom a wide variety of developmental stages, including for example, adult, neonatal, fetal, and 
3 embryo. 

. The polynucleotides encoding the SARS viral proteins may be modified to faciUtate or 
enhance expression. For example, commercial leader sequences known in the art, such as tPA or 
IgK or interleukin-2, may be used in the recombinant constructs. Preferably, however, the 
natural SARS leader sequence is used. Use of the natural leader sequence can be used to ensure 
) that the protein will be trafficked in human ceUs in the same way as during a normal viral 

infection, which may be advantageous e.g. for DNA vaccines, where antigen is expressed in situ. 

As discussed above, tag sequences can be used in the expression constructs to fadlitate 
purification, detection and stability of the expressed protein. Tag proteins suitable for use in the 
invention include a polyarginine tag (Arg-tag), polyhistidine tag (His-tag), FLAG-tag, Strep-tag, 

' c-myc-tag, S-tag, calmodulin-binding peptide, cellulose-binding domain, SBP-tag„ chitin- 

binding domain, glutathione S-transferase-tag (GST), maltose-binding protein, transcription 
termination anti-tenniniantion factor (NusA), E. coli thioredoxin (TrxA) and protein disulfide 
isomeraseI(DsbA). Preferred tag proteins include His-tag and GST. A full discussion on the 
use of tag proteins can be found at Terpe et al, “Overview of tag protein fusions: fiom 

molecular and biochemical fundamentals to commercial systems”, Appl Microbiol Biotechnol 
(2003) 60:523-533. 
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After purification, the tag proteins may optionally be removed fix>m the expressed fusion 
protein, Le., by speciJBcaUy tailored enzymatic treatments known in the art Commonly used 
proteases include enterokinase, tobacco etch virus (TEV), thrombin, and factor Xa. 

One or more amino acid sequences or amino acid domains of the spike protein may be 
5 removed to facilitate mammaUan recombinant expression. For instance, the entire S2 domain or 

the spilm transmembrane region may be removed. Representative examples of some expression 
constructs of both full length and truncated spike glycoprotein suitable for mammalian 
expression ate shown in FIGURE 40. Polynucleotide sequences representing each construct are 
shown in SEQ ID NOS 6578-6583. A description of each annotation is shown below: 



Qone Name 


Description 


Exnression Construct 


nSh 


natural leader sequence 
full length Spike 
histidine tag 


SEQ ID NO: 6578 


nS 


natural leader sequence 
full length Spike 


SEQ ID NO: 6579 


nShATC 


natural leader sequence 

Spike without transmembrane sequence 

histidine tag 


SEQ ID NO: 6580 


nSATC 


natural leader sequence 

Spike without transmembrane sequence 


SEQ ID NO: 6581 


nSlh 


natural leader sequence 
SI domain 
histidine tag 


SEQ ID NO: 6582 


nSl 


natural leader sequence 
SI domain 


SEQ ID NO: 6583 



Cloned cDNA fragments that encompass full-length Spike coding sequences, as well as a 
Spike construct deleted of the transmembrane and cytoplasmic domains (TM-Cy-deleted Spike) 
for secretion were inserted into an expression vector pCMVin to create nSh and nShATC, 
respectively. Both spike proteins were tagged with six histidine residues at the end of C- 
.5 t^minus to aid initial characterization of the expressed spike proteins. Similar sequences 

^coding full-length Spike or transmembrane and cytoplasmic domain deleted Spike, but without 
the histidine “tag” are readily substituted by one of skill in the art. 

The likely locations of the expressed spike constructs was assessed by separating expressed 
proteins into an aqueous fraction (AF) and a detergent fraction (DF) using the procedure shown 
® in Rgure 48, with results of western blot analysis shown in Rgure 43. The above described 

vector constructs were evaluated for expression after transfection into COS7 cells. The construct 
expressing the full length spike protein remained in the cell membrane while the construct 
expressing the truncated spike protein was located either in the cytosol (Figure 43) or secreted 
into the cell medimn (Rgure 44). As shown in Rgure 43, full-length spike protein is found in DF 
5 (membrane) in an aggregated form, while the truncated protein is found in AF (cytosol) as a 
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monomer. As shovm in Figure 44, deleted proteins (ShATC) are secreted, and a gmall fraction of 
full-length spike protein is detected in the medium by rabbit seruiiL 

Recombinantly expressed spike proteins may be oligomerized. When the spike proteins 
are to be used in a vaccine or to gMierate antibodies specific to the spike protein, they are 
5 preferably oligomerized. In order to obtain oligomoized spike protein, it is preferred to 
maintain the transmembrane dom a in in the recombinant expression construct. For example, 
FIGURE 41 illustrates a western blot of COS7 cell lysates comparing expressed nSh and 
nShATC using both anti-his tag and rabbit anti-SARS antibodies. As shown fiill-length (nSh) 
aggregates, but the truncated (nShATC) spike protein does noL Antibody raised against the His- 
10 tagged protein recognizes full-length and truncated spike proteins in native and reduced forms. 
Rabbit antiserum recognizes spike protein only in non-reducing conditions. Spike aggregates or 
oHgomers were present in larger amounts in the ceU lysates from the expressed nSh constructs. 
Preferably, the oligomerized spike proteins form a homotrimer, as indicated in FIGURE 47 
A further experiment, illostrated in FIGURE 42, demonstrates that the oligomerization of 
15 the expressed nSh constructs is likely due to a non-covalent linkage (and is likely not due to, for 
example, a disulfide bond). The oligomer dissociates into monomers at elevated temperature 
(80-100“C), but is stable in reducing conditions if not heated. 

It is further preferred that recombinantly expressed spike proteins are glycoslyated. 
Tunicamycin and glycosidases were used to assess glycosylation. FIGURE 45 illustrates that 
.0 glycoslation of expressed spike proteins is not affected by removal of the transmembrane domain 
region. Both full-length (Sh) and truncated (ShATC) SARS spike proteins are glycosylated. 

PrefMably, expression of the constructs of the invention is not toxic to the mammalian host 
cell. FIGURE 46 demonstrates that expression of the iDustrated spike constructs is not toxic to 
the COS7 host cell. 

■5 Methods for transfecting, expressing, culturing, isolating and purifjdng recombinant 

proteins from mammalian ceU cultures are known in the art For exan^le, the SARS spike 
constructs of the invention may be expressed in 293 cells. These cells may be cultured and 
transfected in static or monolayer cultures. For rapid large-scale production of SARS protein 
antigens in sufficient quantities for in vitro and in vivo evaluation, including immunogenidty 
0 studies, large-scale transient transfection of 293 (human embryonic kidney) cells may be used to 
obtain milligram quantities of the recombinant antigen(s). Alternatively, larger scale transfection 
of these cells may be performed with 293 cells in suspension culture. Preferably, the expressed 
SARS proteins are harvested from the transfected cells between 48 and 72 hours after 
transfection or even from 72 to 96 or more hours after transfection. 

5 Where the host cells are transfected with truncated spike expression constructs, the 

expressed spike protein is secreted firom the host ceUs and coUected from the ceU media. After 
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concentration, the spike protein may be purified fiom the media using, for example, GNA lectin 
followed by DEAE and ceramic hydroxyapatite column chromatogrJ 5 )hy. 

Where the host cells are transfected with fuU length spike expression constructs, but rather 
is retained within the cells, and may be purified fiom triton X-100 detergent extracted cells. The 

5 fuU-length Spike protein can then be captured on GNA lectin, followed by hydroxyapatite and 
SP chromatography. 

Chinese Hamster Ovary (CHO) or other eukaryotic (e.g., mammaUan) ceUs that stably 
express the SARS viral antigens of the invention may also be derived (e.g. Rgute 73). 

Preferably, the cells are CHO cells, and tiiese constructs will conoprise one or more marker or 
.0 selection genes in order to select for the desired CHO cells, fii one embodiment, the constructs 
comprise a CMV enhancer/piomoter, ampicillin resistance gene, and a fused DHFR and 
attenuated neomycin gene for selection purposes. Stable ceU lines can then be produced using 
the neomycin selection system in CHOK-1 cells. Selected clones can then be sequenced to 
verify the mtegrity of the insert, and transient transfections can then be performed using Trans- 
5 LTl polyamine transfection reagent (PanVera Corp., Madison, WI) to assess the expression level 
and also the integrity of the expressed protein by EUSA and western blot analysis. 

Methods for derivation of CHO cells stably expressing the SARS viral antigens of the 
invention comprise the steps of transfection and primary screening with selective medium. 
Optionally, these steps are followed by subcloning to assure purity of cell lines. Cell culture 
0 supernatants can be assayed using an antigen capture EUSA to quantify expression levels at aU 
stages of selection and amplification. 

. For full-length Spike expression constructs, methanol fixed cells can be screened for 
internal expression by immunofluorescent staining using a rabbit anti-SARS antibody. 

Successive measurements at the T75-flask stage of expansion can be employed to assure stability 
> of expression levels. The molecular mass and integrity of the expressed proteins can be checked 

by PAGE both under native and reducing and denaturing conditions, followed by 
immunoprobing. 

fir one embodiment, the pCMV3 vectors expressing SARS-CoV Spike proteins in either 
full-length or truncated forms is introduced into CHOK-1 cells using the Trans-LT-1 reagent 
) On day one, 1x10** cells are plated on 100 mm dishes in non-selective F12 media + 10% Ffetal 
Bovine Serum + 4 mM Glutamine. On day two, the ceils are transfected with a DNA:LT-1 
mixture and the media then replaced with complete F12 media. Twenty-four to forty-eight hours 
later depending on the ceU density, each 100 mm dish is split to 4-6 100 mm dishes. Hie 
medium is changed to complete selective media containing Geneticin (neomycin) at 500 fig/ml. 

AU bovine serum used in these procedures is fiom TSE-fitee sources that meet current FDA 
standards. Twenty-four hours later the medium is changed to complete selective medium plus 
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500 ug^ neomycin. Ten to fourteen days later, individual colonies are picked and transferred 
to 96 weD plates and cultured in complete selective medium but without G418. When 
approximately 80% of tiie wells are confluent, twenty-four hour supernatants are screened by 
spike capture ELISA positive clones are transferred to twenty-four well plates. For the initial 
5 expression of full length Spike protein, methanol fixed cells wiU be screened by 

immunoflourescent staining using a rabbit anti-S ARS antibody. After the low expressing cell 
lines have been eliminated and there are less than 20-30 cell lines, c^ture ELISA and westerns 
will be used to determine the expression level after cell lysis. A portion of each cell line will be 
pelleted, weighed and lysed in 1% triton lysis buffer containing MOPS, NaQ and MgQ 2 at the 
10 same ratio of ceU weight to lysis buffer. After lysis the supernatant is coUected and expression 
level is determined. Three to four clones producing the highest levels of spike protein in correct 
structure and conformation will be grown in three-liter bioreactors for expansion and adaptation 
to low serum suspension culture conditions for scale-up.- 

capture ELISA assay for the SARS spike protein can be performed as 
15 described in the art A brief description of this assay follows. 96 well flat-bottom plates • 
(Coming, Coming, NY) are coated with 250ng per well of purified immunoglobulin obtained 
fiom rabbit sera that were immunized with inactivated SARS virus. Between steps, the plates 
are washed in a buffer containing 16%NaCl and 1% Triton XIOO. lOOpL of supernatant or 
lysate samples (diluted in a buffer containing lOOmM NaP 04 , 0 . 1 % Casein, ImM EDTA, 1% 
iO Triton XIOO, 0.5M NaCl and 0.01% Thromersal, pH 7.5) are added and incubated for 2- hours at 
37“C. Bound antigen is reacted against pooled SARS+ve serum or high affinity monoclonal 
antibody either human or mouse against SARS spike protein (1 hour incubation, 37°C) and 
detected using appropriate species-spedfic peroxidase conjugated second antibody (30 minute 
incubation at 37°C; TAGfO, Burlingame, CA). The plates are developed for 15 minntp-g at room 
-5 temperature using TMB substrate (Pierxe, Rockford, IL) and the reaction stopped using 4N 
phosphoric acid. The plates are read at a wavelength of 450nm and the concentration of protein 
per ml sample is derived fiem a standard curve (OD vs. protein concentration) based on serial 
dilutions of a known concentration of recombinant spike protein. 

The immunoprobing analysis can also be performed following the standard methods 
0 described elsewhere in the art A brief description follows. 10-20 pi ofthe sample is analyzed 
on 4-20% SDS PAGE under non-reducing/ denaturing conditions with mild heating. The gels 
are run for 1.5-2.0 hours at lOOV constant voltage. The proteins are then transferred onto 
nitroceUulose membranes (Millipore, Bedford, MA) for 45 min using the semidry western 
transfer system (BioRad, Hercules, CA) following the manufacturer’s instructions. The 
5 membrane is then reacted against polyclonal anti-spike rabbit serum, followed by anti-rabbit Ig 
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conjugated to Alexa 688 (Molecular Probes, Oregon). The blots are scanned using an infiared 
im a gin g system (Ll-Cor, Ihc., Lincoln, Nebraska). 

The highest expressing candidate ceU lines can be screened for spike protein expression 
and stability in smaU-scale (3 liter) suspension cultures. The candidate clone can be further 
5 evaluated for level of expression as well as integrity of expressed protein after amplification, and 

subsequently tested for expression stability in the absence of selection. The selected clones can 
also be tested for maintenance of the DNA sequence integrity of the integrated SARS spike 
protem gene. To quickly monitor the expression levels in smafi flask (T25 or T75) and in the 
three Hter evaluation cultures, a lectin-based process (Gluvanthus Nivalis lectin) may be used to 
.0 isolate SARS spike protein to a degree of purity that allows semi-quantitation and 

characterization of the protein in CHO supernatant. For fuU-lengih spike protein, it will be 
obtained from triton X-100 detergent extracted ceUs. FuU-length Spike protein wiU be then 
captured on GNA lectin, foUowed by hydroxyapatite and SP chromatograph. Huted protein is 
then characterized by: 1) polyacrylamide gel electrophoresis (PAGE) and Coomassie staining 2) 
5 Immunoprobing with anti-SARS rabbit sera. 3) structural characterization using size exclusion 
chromatography (SEC), as well as mass spec analysis using M AT .n i-TOF. 

Routes and methods of immunization of the vaccines of the invention.are discussed in 

more detail in a section below. Examples 7 to 9 illustrate sample immunization protocols for the 
recombinant spike proteins. 

3 Vaccine testing 

Prior to human admimstration, it is normal to test vaccines iii animal models. A mouse 
model of SARS coronavirus infection is known (Subbarao et al. (2004) J Virol 78:3572-77), and 
other animals that may be used as models of infection and/or disease include ferrets and 
monkeys. Thus the invention provides a non-human animal that is infected by the SARS 
> coronavirus. wherein the animal is preferably a ferret or a primate {e.g. a monkey or a macaque). 

The animal may be gnotobiotic. The animal is preferably not a cat {Felis domesticus). The 
animal may or may not display SARS disease symptoms e.g. ferrets (Mustela faro) show 
prominent pulmonary pathology after infection. See: Martina et al. (2003) Nature 425:915. 

K Polynucleotide s encod i ng the SARS Antieewi nfth^ 

The invention includes polynucleotides encoding for the SARS antigens of the invention. 

In addition, the invention includes polynucleotides which have been optimized for recombinant 
production {e.g. codon optimization) of the SARS antigens of the invention, including 
polynucleotides encoding for each of the SARS fusion constracts discussed above. 
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F^ Viral vector or Viral Particle delivery of the SARS Aroieens of the Tnvtmtinn 

The antigens of the invention may be expressed in vivo or in vitro by polynucleotides 
encoding the antigens. Expression and delivery of the polynucleotides of the invention may be 
facilitated via viral vectors and/or viral particles. 

5 Gene-based delivery systems derived from viruses, such as alphaviruses, are useful for the 

ex vivo and in vivo admiiustration of heterologous genes, including one or more SARS genes, 
having therapeutic or prophylactic applications. These systems can also be used for the 
production of recombinant proteins derived from the SARS virus in cultured cells. Gene-based 
deUvery systems of the invention include viral vectors (e.g., adenovirus vector, poxvirus vector, 
10 alphavirus vector) and non-viral nucleic acid vectors (g.g., DNA, RNA) encoding one or more 
SARS virus antigens. Polynucleotides encoding SARS virus antigen(s) are incorporated into the 

gene-based vaccines individually or in combination (e.g., as bicistronic constructs). 

1. Alnhavims 

Alphaviruses are members of Togaviridae family and share common structural and 

15 replicative properties. Sindbis virus (SIN) is the prototype virus for the molecular study of other 

alphaviruses, and togetho: with Venezuelan equine encephalitis virus (VEE) and Semliki Forest 
virus (SFV), are the most widely utilized alphaviruses being developed into expression vectors 
for heterologous genes (Schlesinger and Dubensky (1999) Curr Opin. Biotechnol. 70:434439; 
Schlesinger (2001) Expert Opin. Biol. Ther. 1 : 177-91). 

•0 ^Phaviruses possess a relatively small single-stranded RNA genome of positive polarity, . 

which is approximately 12 kb in length, capped and polyadenylated. The RNA interacts with 
viral capsrd protern monomers to form nucleocapsids, which in turn, are surrounded by'a host 
cell-denved lipid envelope finm which two viral glycoproteins, El and E2, protrude forming 
“spike” trimers of heterodimeric subunits. Two open reading frames (ORFs) encode as 
5 polyproteins the enzymatic nonstructural repUcase proteins (5’ ORF) and the virion stractural 
proteins (3 ORF). Hie structural polyprotein is translated from a highly abundant subgenomic 
mRNA, which is transcribed from a strong internal alphavirus promoter (Strauss and Strauss 

(1994) Microbiol. Rev. 58:491-562). Replication of the genome occurs exclusively within the 
host cell cytoplasm as RNA. 

3 The most common alphavirus expression vectors have exploited both the positive-stranded 

nature and modular organization of the RNA genome. These vectors, termed “leplicons” due to 
their property of self-amplification, permit insertion of heterologous sequences in place of the 
structural polyprotein genes, while maintaining the 5’- and 3’-end cis rephcation signals, the 
nonstructural replicase genes, and the subgenomic junction region promoter (Xiong et al. (1989) 

) Science 245:1188-1191; liljestrom (1991) Bio/Technology 9:1356-1361). Chimeric alphavirus 

vectors (and particles) from sequences derived from divwgent virus families have also been 
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described, (see, for exan^)le IMted States patent application serial number 09/236,140; see also, 
US Patents 5,789,245, 5,842,723, 5,789,245, 5,842,723, and 6,015,694; as well as WO 95/07994, 
WO 97/38087 and WO 99/18226). Co-owned International Publication WO 02/099035, 
published December 12, 2002 and incorporated by reference in its entirety herein, describes 

5 chimeric alphavirus molecules and modified alphavirus molecules having modified Biosafety 
Levels. 

The absence of structural protein genes renders alphavirus replicon vectors defective, in 
that RNA amplification and high-level heterologous gene expression occurs within the target 
ceU, but cell-tooell spread of vector is not possible due to the inability to form progeny virions. 

.0 Through the years, several synonymous terms have emerged that are used to describe alphavirus 
replicon particles. Drese terms include recombinant viral particle, recombinant alphavirus 

particle, alphavirus repficon particle and replicon particle. However, as used herein, these terms 

all refer to a virion-like unit containing an alphavirus-derived RNA vector replicon. Moreover, 
these tenns may be referred to coDectively as vectors, vector constructs or gene delivery vectors. 

5 Packaging of replicon RNA into particles can be accomplished by introducing the repUcon 

RNA into permissive cells (c.g.. RNA or DNA transfection, or particle infection) that also 
cont^ one or more structural protein expression cassettes or “defective helper^’ constructs 
encoding the alphavirus structural proteins. These structural protein encoding constructs may 
themselves be introduced into the cells by transfection of either RNA or DNA, and most 

3 retain the native alphavirus subgenomic promoter, as well as 5’- md 3 ’-endow 

signals for co-amplification with the repUcon, but are devoid of any replicase genes and the RNA 
packaging signal (Uljestrom {1991) Bio/Technology 9:1356-1361; Pushko or uZ. (1997) Virology 
239:389-401;Poloora/.(1999)PNAS96:4598-4603). Permanent ceU lines that are stable 

transformed with constructs expressing the alphavirus structural proteins {e.g, packaging cell 

> lines) offer a means to avoid transient transfection production methods (Polo et oL (1999) PNAS 
96:4598-4603). 

The present invention includes compositions and methods for the production of replication 
defective viral vector particles (e.g., alphavirus replicon particles) for use in the ex vivo and in 
vivo administration of heterologous genes encoding proteins having therapeutic or prophylactic 

' application, including genes encoding for one or more SARS viral antigens. 

In one aspect, the invention includes a method of producing replication defective viral 
vector particles {e.g., alphavirus replicon particles) comprising the steps of introducing at least 
one nucleic acid molecule comprising a viral vector {e.g., alphavirus replicon RNA) into 
immortalized cells of the present invention, under conditions that allow for complementation of 
the viral vector {e.g., alphavirus repUcon RNA) and production of viral vector particles {e.g., 
alphavirus repUcon particles), and isolating the viral vector particles from the ceUs or ceU culture 
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supernatants. In certain ranbodiments, the inomoitalized ceDs are grown in suspension, for 
exan^jle PERC.6 cells. In other embodinients, the methods are performed in large-scale 
volumes, for example, liter volumes or greater, such as for example in roller bottles, large flasks. 
Nunc Cell Factories, Coming Cell Cubes, fermentation vessels, etc). 

5 In certain embodiments, the viral vector is an alphavirus replicon RNA that requires 

complementation by providing one or more alphavirus stractural proteins in trans, within the 
immortalized cell. In such instances, the methods of complementation to produce alphavirus 
replicon particles may involve the introduction of one or more nucleic acids (e.g., RNA, DNA) 
encoding said alphavirus structural protein(s) (e.g.. capsid and/or envelope glycoproteins) into 
10 the immortalized cells, either transiently or stably, and either concurrent with or prior to the 

introduction of the alphavirus replicon RNA. In certain embodiments, the alphavirus repUcon 
RNA is introduced into the cell by transfection an in vitro transcribed RNA. hi other 
embodiments, the alphavirus replicon RNA is introduced into the cell by transfection of a DNA 
(e.g., ELVIS), which is capable of transcribing within the ceU, the replicon RNA. Li yet other 
'.5 embodiments, the alphavirus replicon RNA is introduced into the cell by infection with a seed 
stock of alphavirus replicon particles. In certain embodiments, the nucleic acids encoding said 
alphavirus stmctural protein(s) are defective helper RNA or are DNA that can transcribe within 
the cell defective helper RNAs. 

As discussed herein, "alphavirus RNA replicon vector". "RNA replicon vector", "replicon 

;0 vector" or “repUcon” refers to an RNA molecule that is capable of directing its own 

amplification or self-replication in vivo, within a target cell. To direct its own amplification, the 
RNA molecule should encode the polymerase(s) necessary to catalyze RNA amplification (e.g., 
alphavirus nonstructural proteins nsPl, nsP2, nsP3, nsP4) and also contain cis RNA sequences 
requited for repfication which are recognized and utilized by the encoded polymerase(s). An 
5 alphavirus RNA vector replicon should contain the foUowing ordered elements: 5’ viral or 
cellular sequences required for nonstructural protein-mediated amplification (may also be 
referred to as 5' CSE, or 5’ cis replication sequence, or 5' viral sequences requited in cis for 
replication, or 5’ sequence which is capable of initiating transcription of an alphavirus), 
sequences which, when expressed, code for biologically active alphavirus nonstractural proteins 
3 (e.g., nsPl, nsE*2, nsP3, nsP4), and 3' viral or cellular sequences required for nonstmctural 

protein-mediated amplification (may also be referred as 3’ CSE, or 3' viral sequences required in 
cis for replication, or an alphavirus RNA polymerase recognition sequence). The alphavirus 
RNA vector replicon also should contain a means to express one or more heterologous 
sequence(s), such as for example, an IRES or a viral (e.g., alphaviral) subgenomic promoter 
) (e.g., junction region promoter) which may, in certain embodiments, be modified in order to 

mcrease or reduce viral transcription of the subgenomic fragment, or to decrease homology with 
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defective helper or structural protein expression cassettes, and one or more hetoologous 
sequence(s) to be expressed. Preferably the heterologous sequence(s) comprises a protein- 
encoding ^ne, which is the 3’ proximal gene within the vector replicon. And preferably the 
replicon further comprises a polyadenylate tract 

5 As discussed herein, "recombinant Alphavirus Particle”, “alphavirus replicon particle” and 

“replicon particle" refers to a viiion-like unit containing an alphavirus RNA vector replicon. 
Generally, the recombinant alphavirus particle comprises one or more alphavirus stractural 
proteins, a lipid envelope and an RNA vector replicon. Preferably, the recombinant alphavirus 
particle contains a nucleocapsid stracture that is contained within a host cell-derived lipid 
10 bilayer, such as a plasma membrane, in which one or more alphaviral envelope glycoproteins 
(e.g., E2, El) are embedded. The particle may also contain other components (e.g., targeting 
elements such as biotin, other viral structural protems or portions thereof, hybrid envelopes, or 
other receptor binding ligands), which direct the tropism of the particle from which the 
alphavirus was derived. Generally the interaction between alphavirus RNA and structural 
15 protein(s) necessary to efficiently form a n^licon particle or nucleocapsid may be an RNA- 
protein interactioii between a capsid protein and a packaging signal or packaging serjuence 
contained within the RNA. 

"Alphavirus packaging cell line" refers to a cell which contains one or more alphavirus 
stmctural protein expression cassettes and which produces recombinant alphavirus particles 
10 (replicon particles) after introduction of an alphavirus RNA vector replicon, eukaryotic layered 
vector initiation system, or recombinant alphavims particle. The parental cell may be of 
m a mm alian or non -m a mm alian origin. Within preferred embodiments, the packaging cell line is 
stably transformed with the stractural protein expression cassette(s). 

“Defective helper RNA” refers to an RNA molecule that is capable of being amplified and 
J5 expressing one or more alphavirus stractural proteins within a eukaryotic cell, when that ceD also 
contains functional alphaviras.nonstructural “replicase” proteins. The alphavirus nonstractural 
proteins may be expressed within the ceU by an alphavirus RNA replicon vector or other mi»gT>Q 
To permit amplification and stractural protein expression, mediated by alphavirus nonstractural 
proteins, the defective helper RNA molecule should contain 5’-end and 3’-end RNA sequences 
iO required for amplification, which are recognized and utilized by the nonstractural proteins, as 

well as a means to express one or more alphavirus structural proteins. Hius, an alphavirus 
defective helper RNA should contain the following ordered elements; 5' viral or cell ular 
sequences required for RNA amplification by alphavirus nonstractural proteins (also referred to 
elsewhere as 5’ CSE, or 5' cis replication sequence, or 5’ viral sequences required in cis for 
5 replication, or 5’ sequence which is capable of initiating transcription of an alphavirus), a means 
to express one or more alphavirus structural proteins, gene sequence(s) which, when expressed, 
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codes for one or more alphavirus structural proteins (e.g., C, E2, El), 3’ viral or cellular 
sequences required for amplification by alphavirus nonstructural proteins (also referred to as 3’ 
CSE, or 3' viral sequences required in cis for replication, or an alphavirus RNA polymerase 
recognition sequence), and a preferably a polyadenylate tract Generally, the defective helper 
5 RNA should not itself encode or express in their entirety all four alphavirus nonstructural 
proteins (nsPl, nsP2, nsP3, nsP4), but may encode or express a subset of these proteins or 
portions thereof, or contain sequence(s) derived fiom one or mote nonstmctural protein genes, 
but which by the nature of their inclusion in the defective helper dp. not express nonstmctural 
protein(s) or portions thereof. As a means to express alphavirus stmctural protein(s), the 
10 defective helper RNA may contain a viral (e.g., alphaviral) subgenomic promoter which may, in 
certain embodiments, be modified to modulate transcription of the subgenomic Augment, or to 
decrease homology with replicon RNA, or alternatively some other means to effect expression of 
the alphavirus structural protein (e.g., internal ribosome entry site, ribosomal readthrough 
element). Preferably an alphavims structural protein gene is the 3’ proximal gene within the 
5 defective helper; In addition, it is also preferable that the defective helper RNA does not contain 
sequences that facilitate RNA-protein interactions with alphavims stmctural protein(s) and 
packaging into nucleocapsids, virion-like particles or alphavirus replicon particles. A defective 
helper RNA is one specific embodiment of an alphavims stmctural protein expression cassette. 

Alphavirus for use in the invention may be grown in any one of the cell lines discussed 
0 above as suitable for the SARS virus. 

Alphavirus replicon particles may be produced according to the present invention by using 
the above cell lines (e.g., immortalized cell lines) and a variety of pubUshed and accepted 
alphavims vector methodologies. Such methodologies include, for example, transient packaging 
approaches, such as the co-transfection of in vitro transcribed replicon and defective help«- 
5 RNA(s) (Uljestrom, Bio/Technology 9:1356-1361, 1991; Bredenbeek et al., J. Virol. 57:6439- 
6446, 1993; Frolov etal,J. Virol 71 :2819-2829, 1997; Pushko et al.. Virology 259:389-401, 
1997; US Patents 5,789,245 and 5,842,723) or co-transfection of plasmid DNA-based replicon 
and defective helper constract(s) (Dubensky et al., J. Virol 75:508-519, 1996), as weD as 
introduction of alphavirus stmctural protein expression cassettes (e.g., DNA-based defective 
> helper) into immortalized ceU lines of the present invention to create stable packaging ceU lines 
(PCL) (Polo et al., PNAS 96:4598-4603, 1999; US Patents 5,789,245, 5,842,723, 6 015 694- WO 
97/38087, WO 99/18226, WO 00/61772, and WO 00/39318). Stable packaging cell to may 
then be utilized for alphavims replicon particle production. The PCL may be transfected with in 
vitro transcribed alphavirus replicon RNA, transfected with a plasmid DNA-based lepUcon (e.g., 
ELVIS vector), or infected with a seed stock of alphavirus replicon particles, and then incubated' 
under conditions and for a time sufficient to produce progeny alphavirus replicon particles in the 
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culture supernatant In addition, progeny replicon particles can subsequently be passaged in 
additional cultures of naive PCL by infection, resulting in further expansion and commercial 
scale preparations. Importantly, by using defective helper RNA or stable PCL based on the 
split* structural gene configuration, these replicon particle stocks may be produced fiee fiom 
5 detectable cont amin ating RCV. 

Following harvest, crade culture supernatants containing the chimeric alphavirus replicon 
particles may be clarified by passing the harvest through a filter (e.g., 0.2 uM, 0.45 uM, 0.65 uM, 
0.8 uM pore size). Optionally, the crude supernatants may be subjected to low speed 
centrifugation prior to filtration to remove large cell debris. Within one embodiment, an 
10 endonuclease (e.g., Benzonase, Sigma fflE8263) is added to the preparation of alphavirus replicon 
particles before or after a chromatogr^hic purification step to di^ exogenous nucleic acid. 
Further, the preparation may be concentrated prior to purification using one of any widely known 
methods (e.g., tangential flow filtration). Crude or clarified alphavirus rqplicon particles may be 

concentrated and purified by chromatographic techniques («.g., ion exchange chromatography, 

5 size exclusion chromatography, hydrophobic interaction chromatography, afSnity 

chromatography), such as those described in WOOl/92552, incorporated by reference in its 
entirety herein. Two or more such purification methods may be performed sequentiaUy. 

EXAMPLE OF ALPHAVIRUS REPUCON PARTICLES ENCODING SARS VIRUS SPIKE (S) 
ANTIGEN 

0 The invention includes compositions and methods for the production of replication 

defective viral vector particles (e.g., alphavirus repHcon particles) for use in the ex vivo and in 
VIVO admimstration of heterologous genes encoding proteins having therapeutic or prophylactic 
application, including genes encoding for one or more SARS viral antigens. 

The following example illustrates a method of preparing alphavirus replicon particles 
5 encoding SARS virus spike (s) antigen. 

Ihe SARS virus spike gene can be incorporated into alphavirus replicon particles derived 
from a variety of alphavirus, such as Sindbis virus, SemliM Forest virus (US 5739026), 
Venezuelan equine encephalitis vims (US 6531135), and replicon particle chimeras derived from 
more than one alphavirus (US 6376236, WO 02/99035). In addition, the SARS virus spike gene 
) can be incorporated in its entirety (encoding full-length spike protein) or in a modified form that 
includes, for example, sequence deletions or trancations, such that the encoded a spike protein is 
of less than full-length (e.g., C-terminal truncation of one or more (e.g. at least 1, 2, 3, 4, 5, 6, 7, 

8, 9, 10, 15, 20, 25, 30 etc.) amino acids, deleted of transmembrane region and cytoplasmic tail). 

For example, the spike gene may be cloned as a full-length gene into the VCR-chim2.1 
5 vector (WO 02/99035) by standard RT-PCR conditions or by standard subcloning fiom one of 
the other plasmids described herein, using commerciaDy available restriction endonucleases. For 
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the rever^ transcription step in standard RT-PCR, the Superscript pre-amplification kit 
(Invitrogen™) and the primCT SEQ ID NO: 7325 (sp-RT-R) are used: 

For the amplification step, the cDNA polymerase advantage kit (aonetech) and two 
primers Sp-F-BbvCI (SEQ ID NO: 7326) and Sp-R-Notl (SEQ ID NO: 7327) are used* 

5 The forward primer is designed to contain the ccacc sequence (Kozak, 1991 JBC 19867- 

70) in fiont of the ATG codon to optimize translation efficiency of the spike gene. Also, the 
forward primer contains the BbvCI restriction site and the reverse primer contains the Notl 
restriction site for subsequent cloning of the PCR amplified gene. 

The PCR product is purified using the QIAquick Nucleotide Removal kit (QIAgen), 

10 digested with BbvCI and Notl, gel purified with QIAquick Gel Extraction kit (QIAgen), and 

ligated to plasmid VCR-Chim2.1 pre-digested with the same enzymes. Qones containing the 

SARS spike sequence are verified by sequencing and the new construct is called VCR-Chim2.1- 
SARSspike. 

To generate VEErep/SINenv-SARSspike replicon particles the plasmids VCR-Chim2.1- 
5 SARSspike, VCR-DH-Scap (WO 02/99035), and VCR-DH-Sglydll60 (WO 02/99035) are 
linearized with the restriction enzyme Pmel and used foir in vitro transcription as described 
previously (Polo et al. 1999, PNAS 96: 4598-603; W002/99035). The transcripts ate co- 
transfected into BHK cells as previously described (Polo et al . , 1999, ibid ; W002/99035). The 
transfected cells are incubated at 34 °C, the supernatants collected at 20 and 30 hrs post- 
0 elecfioporation, clarified by centrifugation, and purified by chromatography as previously 
described (WO 01/92552). 

. Expression of the SARS spike protein fixrm the replicon particle vector is verified by 
infecting BHK cells overnight with purified VEErep/SINenv-SARSspike or VEErep/SINenv- 
GFP (WO 02/99035) replicon particles. In addition, BHK cells also were transfected in paraUel 
5 wrth m vitro transcnbed VCR-Chim2. 1-SARSspike replicon RNA. At 16 hrs post-infection and 

transfection cells are lysed and a sample of the lysate analyzed by western blot using an antibody 
that recognizes SARS virus spike protein. The proteins on the gel are stained or transferred to a 
membrane for Western blot analysis with sera from convalescent patients or alternatively murine 
or rabbit antisera generated against SARS virus. VEErep/SINenv-SARSspike replicon particles 
) are administered to the vaccine recipient {e.g., rodent, non-human primate, human) as described 
elsewhere in the present invention. 

Figure 67 shows data fiom western blot analysis p^ormed under non-reducing conditions, 
using a SARS virus specific rabbit polyclonal antisera. The western data demonstrate that not 
only is SARS spike protein expressed in cells infected with alphavirus replicon particles or 
transfected with replicon RNA, but the predominant form of spike is that of a homotrimer 
(Hg.67A). Similar homotrimeric. association of the spike protein was observed in western blots 
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of SARS vinons punfied from SARS viius infected VERO cell supernatants, and this 
homotrimer is heat labile, as indicated by the dissociation into monomedc forms at 80®C and 
100®C (Rg.67B). 

To further characterize SARS Spike protein expression and processing following 
5 expression from alphavirus rq>Iicon vectors, BHK-21 cells were infected with alphavirus 

replicon particles expressing the full-length Spike. At 6 hr post-infection with an MOI of 5, 
infected cells were labeled for 1 hr with E-[^^S]methionine/cysteine and chased for the indicated 
time. The [“S]-labeled spike protein was immunopredpitated by anti-SARS rabbit serum and 
digested with Endo-H. Both digested and undigested proteins were analysed by 4% 

10 polyacrylamide-SDS PAGE under reducing conditions. As shown in Rgure 55, the full-length 
spike protein is synthesized as an Endo-H sensitive high mannose glycoprotein (gpl70, an ER 
form) that undergoes modification to an Endo-H resistant glycoprotein with complex 
oligosaccharides (gpl80, a Golgi form). The conversion of gpl70 into the gpl80 form takes 
place within 2 hr. ■ 

15 Alphavirus replicon particles expressing one or more SARS proteins (e.g., 

VEErep/SINenv-SARSspike replicon particles) are adrmnistered to the vaccine recipient in order 
to induce a SARS specific immune response (e.g„ rodent, ferret, non-human primate, human) as 
described elsewhere in the present invention. Immunization may be performed through a variety 
ofroutBs,includingforexample,intramuscular, subcutaneous, intradermal, and intranasal. In 
20 additon, the alphaviras replicon particles may be used alone or in combination (e.g . , “prime- 
boost”) with other vaccine approaches of the present invention, or alternatively the alphavirus 
replicon particles may co-express antigen from other respiratory pathogens or be co-administered 
in combination with alphavirus replicon particles expressing antig^s from other respiratory 
pathogens (e.g;, influenza virus, parainfluenza virus, respiratory syncytial virus, human 
25 metapneumovirus). For example, the induction of anti-spike protein antibodies in animals 
immunized IM with VEErep/SINenv-SARSspike replicon particles was demonstrated in mice 
OFigure 68). These mouse studies also included addtional vaccine groups for comparison, 
including the inactivated SARS virus and recombinant truncated spike protein vaccines describe 
elsewhere herein, as weU as plasmid DNA used as a prime, followed by alphavirus replicon 
10 particles as a boost The data clearly show very potent immu ne, responses for all vaccine groups, 
including the alphavirus r^licon particle group. It should be noted that the level of antibody 
induced by the inactivated SARS virus vaccine used in these experiments has been shown to be 
protective in a SARS virus animal challenge model. 

Similarly, genes encoding other SARS virus antigens (e.g., nucleoc^sid protein, 

•5 membrane glycoprotein) are cloned into alphavirus replicon vectors, eith^ individually or in 
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combination, to generate alphavirus lepHcon particles according to the teachings of the present 
invention and using standard molecular biology techniques.. 

EXAMPLE OFALPHAVmUS-BASED PLASMID DNA EXPRESSING SARS VIRUS SPIKE 
(S) 

5 The invention includes preparation of plasmid DNA expressing a SARS virus antigen for 

prophylactic or therapeutic immunization against SARS virus infection. In one embodiment, the 

SARS viral antigen is a spike (S) protein. La one embodiment, the plasmid DNA is alphavirus- 
based. 

The following example illustrates one method for preparing an alphavirus-based plasmid 
0 DNA expressing SARS virus spike (S). 

SARS spike gene can be deUveied using any of Ihe alphaviius-based plasmid DNA 

cepHcons such as ELVS (Dubensky et al, 1996 J Viiol. 70: 508-19). SINCP (WO 01/81609), or 
VCP (PCT WO 02/99035). 

For example, the SARS spike gene is cloned into SINCP using the standard RT-PCR 
5 techniques. The oligo Sp-RT-R is used for the reverse transcription step with the Superscript pre- 
amplification kit (Invitrogen). For the amplification step, the cDNA polymerase advantage kit 
(Qonetech) with the Sp-R-Notl and Sp-F-Xhol (SEQ ID NO: 7328) primers is used. • 

The Sp-F-Xhol primm was designed to contain the ccacc sequence in finnt of the ATG 
codon to optimize translation efficiency (Kozak 1991, ibid) of the spike gene. Also, the primer 
) contains the Xhol restriction site for the subsequent cloning of the PCR amplified gene. 

The PCR product is purified using the QIAquick Nucleotide removal kit, digested with 
Xhol and Notl, gel purified with QIAquick Gel Extraction kit, and ligated to plasmid SINCP pre- 
digested with the same enzymes. Clones containing the SARS spike sequence are verified by 
sequencing and the new constmct is called SINCP-SARSspike. 

Expression of the SARS spike gene is verified by transient transfection of BHK cells with 
2ng of either plasmid DNA SINCP-SARSspike or SINCP pre-incubated for 5 minutes with 5 pi 
of TransTT Polyamine reagent (Minus) in low saum medium Optimem (life Technologies). At 
48 hrs pos-transfection cells are lysed and a sample of the lysate is run on 8% SDS-PAGE. The 
proteins on the gel ^ either stained or transferred to a membrane for Western blot analysis with 
sera from convalescent patients, or alternatively with sera fiom mouse or rabbits. 

SINCP-SARSspike plasmid replicon is administered to the vaccine recipient (e.g., rodent, 
non-human primate, human) as a formulated or unformulated plasmid vaccine, alone or in 

combmation (e.g., “prime-boost”) with other vaccines of the present invention, as described 
elsewhere herein. 

Similarly, genes encoding other SARS virus antigens (e.g., nucleocapsid protein, 
membrane glycoprotein) are cloned into alphavirus plasmid replicon vectors. 
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2. Plasmid Expression Vectors 

EXAMPLE OF PLASMID DNA EXPRESSING SAKS VIRUS SPIKE (S) 

Hie following example illustrates a method for preparing plasmid DNA expressing SARS 
virus spike (s). 

5 The SARS virus spike antigen also may be delivered using other plasmid DNA expression 

vectors (sometimes referred to as “conventional” DNA vaccines), based on a polymerase II 
promoter, such as, for example, a CMV promoter. A DNA vaccine of the spike antigen gene 
induces an antibody response in mice ^ao et al. (2004) Acta Biochim et Biophysica Sinica 
36:37-41), and has been found to induce viral neutralization and protective immunity in mice 
10 (Yang et al. (2004) Nature 428:561-564), particularly when truncated at the C-terminus. 

For example, the SARS spike gene is cloned into pCMVKm2 (Zur Megede et al., J. Virol., 
74:2628-2635, 2000; SEQ ID NO: 9923) using standard RT-PCR techniques. Hie oligo Sp-RT- 
R is used for the reverse transcription step with the Superscript pre-amplification kit 
(hivitrogen). For the amplification step, the cDNA polymerase advantage kit (Qonetech) is.used 

15 with primers Sp-E-EcoRI (SEQ ID NO: 7329) and Sp-R-Xbal (SEQ ID NO: 7330). 

The forward primer was designed to contain the CCACC (SEQ ID NO: 7331) sequence in 
front of the ATG codon to optimize translation efficiency (Kozak 1991, ibid.) of the spike gene. 
Also, the forward primer contains the EcoRI restriction site and the reverse primer contains the 
Xbal restriction site for the subsequent cloning of the PCR amplified gene. 

10 The PCR product is purified using the QIAquick Nucleotide Removal kit, digested with 

Xhol and Notl, gel purified with QIAquick Gel Extraction Mt, and ligated to plasmid . 
pCMYKm2 pre-digested with the same enzymes. Clones containing the SARS spike sequence 
are verified by sequencing and the new constmct is called pCMVKm2-SARSspike. 

Expression of the SARS spike gene is verified by transient transfection of BHK or 293 
:5 cells with 2jig of either plasmid DNA pCMVKm2-SARSspike or pCMVKm2 pre-incubated for 

5 minutes with 5 pi of TranslT Polyamine reagent (Mirrus) in low serum medium Optimem (Life 
Technologies). At 48 hrs pos-transfection ceUs are lysed and a sample of the lysate is run on 8 % 
SDS-PAGE. Hie proteins on the gel are either stained or transferred to a membrnne for Western 
blot analysis with sera from convalescent patients, or alternatively using mouse or rabbit 
0 antism. 

Plasmid pCMVKm2-SARSspike is administered to the vaccine recipient (e.g., rodent, non- 
human primate, human) as a formulated or unformulated plasmid vaccine, as described 
elsewhere in the present invention. 

Similarly, genes encoding other SARS virus antigens {e.g., nucleocapsid protein, 

5 membrane glycoprotein) are cloned into plasmid expression vectors 
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3^ Vinis-Uce Particles co mprising S ARS antigens 

The SARS viral antigens of the invention may be formulated into Virus like Particles 
(“VLPs”). The invention thus includes virus-like particles (or VLPs) comprising one or more 
SARS viral antigens. Preferably, the VLPs comprise one or more SARS viral antigens selected 
5 firom the group consisting of Spike (S), nucleocapsid (N), membrane (M) and envelope (E). 

Preferably, the VLPs comprise at least M and E. 

•n» VLPs of the invention comprise at least one particle-fonnmg polypeptide. Said 
pamcle-fonning polypeptide is preferably selected &om a Cotonavirea stmctnral protein. In one 
embodrment. the particle-fotming polypeptide is selected from one or more SARS viral antigens 

10 i-“0*«endmdimeot,theparticle-fo.mingpolypeptideisseleetedSomthestmctn^ . 

of a non^S ARS Coronavirns, such as, for example, Mouse Hepatitis Virus. 

VLPs can Ik formed when viral stractural ptoteitis are expressed in eukaryotic or 

prokaryotic expret^ion systems. Upon expression, the structural proteins self-assemble to form 

particles. Altemahvely, vital structnral proteins may be isolated ftom whole virus and 

5 formulated with phosphoUpids. Such viral structnral proteins are referred to herein as •■particle- 
forming polypeptides-. VLPs are not infectious hecanseno viral genome is present, however, 

these non-replicating, Vitus capsids mimic the structure of native virions. 

Due to their stnanure, VLPs can display a large number of andgemc sites on their surface 

(snmiar to a native virus). VIPs offer an advantage to live or attenuated vaccines in that they are 
3 much safer to both produce and administer, since they are not infectious. VIPs have been 

shown to induce both neuttalixing antibodies as weU as T-cell responses and can be r~.,-n,-r py 
both cl 0 ss I and H MHC pathways. 

Previous work creating VLPs ftom coronavirns indicates that E and M proteins along may 
be sufficient for coronavirus VLP formation. Fischer er al., J. Virol. (1998) 72:7885-7894 
» and Vennema er al. EMBO J. (1996) 15:202(1-2028. 

Chimeric VLPs comprising particle-forming polypeptides or portions thereof ftom non- 
SARS Coronaviruses are also included in the invention. Such parliele-fotming polypeptides may 

compnse a fuB length polypeptide ftom a non-SARS Coronavirus. Altemadvely, a pardcle- 
forming fragment may be used. 



In one embodiment, a fragment of a non-SARS particle-forming polypeptide and a 
fragment of a SARS viral antigen are fused together. For instance, such chimeric polypeptides 
may comprise the the endodomain and transmembrane domain of a non-SARS particle-forming 
polypeptide and the ectodomain of a SARS viral antigen. In one example, the VLPs of the 
invention comprise a chimeric spike protein comprising an endodomain and transmembrane 
domain of the spike protein of Mouse Hepatitis Virus (MHV) and the chimeric spike protein 
finther comprises the ectodomain of the SARS spike protein. Such VLPs may further comprise 
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Coronavirus M and E proteins. Said M and E proteins may be selected from any coronaviius. 
including Mouse Hepatitis Virus (MHV) or SAKS. Sample sequences of S, M and E proteins of 
MHV are included in the figures, supra. 

Chimeric spike proteins derived from the ectodomain of feline infectious peritonitis virus 
5 (PIPV) spike protein fused to the endo and transmembrane domains of MHV spike protein have 
been previously disclosed. See WO 98/49195 and WO 02/092827. In these chimeric VLP 
stractures, the capsid structure of the VLPs is formed by the M and E protein of MHV. The 

chimeric spike protein provides for the surface exposure of the ectodomain of the HPV spike 
protein. 

10 As used herein, the term “virus-like particle” or “VLP” refers to a non-repUcating, empty 

virus shell. VLPs are generaUy composed of one or more viral proteins, such as, but not limited 
to those proteins referred to as capsid, coat, sheU, surface and/or envelope proteins, or particle- 

forming polypeptides derived from these proteins. VLPs can form spontaneously upon 
recombmant expression of the protein in an approrpirate expression system. Alternatively, viral 
.5 stractural proteins may be isolated from whole virus and formulated with phospholipids. 

Methods for producing particular VLPs are known in the art and discussed more fully below. 

The presence of VLPs in a composition can be detected using conventional techniques known in 
the art, such as by electron microscopy, x-ray crystaUography, and the like. See, e.g.. Baker et 
al., Biophys. J. (1991) 60:1445-1456; Hagensee et al., J. Virol (1994) 68:4503-4505. For 
0 example, cryoelectron microscopy can be performed on vitrified aqueous samples of the VLP 
preparation in question, and images recorded under appropriate exposure conditions. 

The phrase “particle-forming polypeptide” includes a full-length or near full-length viral 
protein, as well as a fragment thereof, or a viral protein with internal deletion, which has the 
ability to form VLPs under conditions that favor VLP formation. Accordingly, the polypeptide 
5 may comprise the full-length sequence, fragments, truncated and partial sequences, as well as 

analogs and precursor forms of the reference molecule. The term therefore include deletions, 
additions and substitutions to the sequence, so long as the polypeptide retains the ability to form 
a VLP. Thus, the term includes natural variations of the specified polypeptide since variations in 
coat proteins often occur between viral isolates. The term also includes deletions, addition and 

) substitutions that do not naturally occur in the reference protein, so long as the protein retains the 
ability to form a VLP. 

Preferred substitutions are those which are conservative in nature, i.e., those substitutions 
that take place within a family of amino acids that are related in their side chains. Specifically, 

ammo acids are generaUy divided into four famihes: ( 1 ) acidic: aspartate and glutamate; ( 2 ) 

) basic: lysine, arginine, and histidine; (3) non-polan alanine, vaUne, leucine, isoleucine, proUne, 
phenylalanine, methionine, tryptophan; and (4) uncharged polar glycine, asparagine, glutamine, 
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cystine, seme, theronine, tyrosine. Phenylalanine, tryptophan, and tyrosine are sometimes 
classified as aromatic amino acids. For example, it is reasonably predictable that an isolated 
replacement of leucine witfi isoleucine or valine, an asparate with a glutamate, a threonine with a 
serine, or a similar conservative replacement of an amino acid with a structurally related amino 
5 acid, will not have a major effect on the biological activity. Proteins having substantially the 
same amino add sequence as the reference molecule, but possessing minor aminn add 
substitutions that do not substantially affect the immunogenicity of the protein, ate therefore 
within the definition of the reference polypeptide. 

Ihe VLPs of the invention can be formed from any viral protdn, particle-f orming 
10 polypeptide derived from the vital protein, or combination of viral proteins or fingments thereof, 
that have the capability of forming particles under appropriate conditions. The requirements for 
the particle-forming viral proteins are that if the particle is formed in the cytoplasm of the host 
cell, the protein must be suifidently stable in the host cell in which it is expressed such that 
• formation of virus-lilce structures will result, arid that the polypeptide will automatically 
15 assemble into a virus-like structure in the cell of the recombinant expression system used. If the 
protein is secreted into culture media, conditions can be adjusted such tiiat VLPs will form. 
Furthermore, the particle-forming protein should not be cytotoxic in the expression host and 
should not be able to replicate in the host in which the VLP will be used. 

Preferred particle-forming polypeptides include coronavirus M and E proteins, preferably 
10 SAKS M and E proteins. 

Methods and suitable conditions for forming particles from a wide variety of viral proteins 
are known in the art. VLPs have been produced, for example from proteins derived from 
influenza virus (such as HA or NA), Hepatitis B virus (such as core or capsid proteins). Hepatitis 
E virus, measles virus, Sindbis virus. Rotavirus, Foot-and-Moutii Disease virus. Retrovirus, 

15 Norwalk virus, human Papilloma virus, HIV, RNA-phages, QB-phage (such as coat proteins), 
GA-phage, fi’-phage, AP205 phage, and Ty (such as retiotransposon Ty protein pi). VLPs are 
discussed further in WO 03/024480, WO 03/024481, and Niikura et oL, Virology (2002) 
293:273-280; Lenz eted., J. Immunology (2001) 5246-5355; Pinto, et al., J. Infectious Diseases 
(2003) 1M:327-338; and Gerber et al., J. Virology (2001) 75(l0):4752-4760. 

10 As explained above, VLPs can spontaneously form when the particle-forming polypeptide 

of interest is recombinantly expressed in an appropriate host cell. Ihus, the VLPs for use in the 
present invention may be prq>ared using recombinant techniques, well known in the art Li this 
regard, genes encoding the particle-forming polypeptide in question can be isolated from DNA 
libraries or directly firom cells and tissues containing the same, using known techniques. The 
>5 genes encoding the particle-forming polypeptides can also be produced synthetically, based on 
the known sequences. The nucleotide sequence can be designed with the appropriate codons for 
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the particular amino sequence desired. In general, one will select preferred codons for the 
intended host in which the sequence will be expressed (e.g. human codons for human DNA 
vaccines). Hie complete sequence is generally assembled from overlrqiping oligonucleotides 
prepared by standard methods and assembled into a complete coding sequence. See., e.g.. Edge, 
5 Nature (1981) 292:756; Nambair et oL Science (1984) 223:1299; Jay et al., J. Biol Chem. (1984) 
259:6311. 

Once the coding sequences for the desired particle-forming polypeptides have been 
isolated or synthesized, they can be cloned into any suitable vector or replicon for expression. 
Numerous cloning vectors are known to those of skill in the art, and the selection of an 
10 appropriate cloning vector is a matter of choice. See, generally, Sambrook er oZ. The vector is 
then used to transform an appropriate host cell. Suitable expression systems include, but are not 
limited to, bacterial, mammalian, bacuolvirus/insect, vaccinia, Semliki Forest virus (SFV), yeast, 
and Xenopus expression systems, well known in the art . 

A nmnber of ceU lines suitable for use as host cells for producing the VLPs of the 
15 ■ invention are known in the art Suitable mammalian ceU' lines include, but are not limited to, 
Chinese Hamster Ovary (CHO) cells, HeLa cells, baby hamster kidney (BHK) cells, monkey 
kidney cells (COS), human hepatocellular carcinoma cells (e.g.. Hep G2), Madin-Darby bovine 
kidney (“MDBK”) cells, as well as others. Mammalian sources of cells include, but are not 
limited to, human or non-human primate (e.g., MRC-5 (ATCC CCL-171), WI-38 (ATCC CCL- 
20 75), HUH, human embryonic kidney cells (293 cells, typically transformed by sheared 

adenovirus type 5 DNA), VERO cells from monkey kidneys (including, for example COS7 
cells), horse, cow (e.g., MDBK cells), sheep, dog (e.g., MDCK cells from dog kidneys, ATCC 
CCL34 MDCK (NBL2) or MDCK 33016, deposit number DSM ACC 2219 as described in WO 
97/37001), cat, and rodent (e.g„ hamstmr cells such as BHK21-F, HKCC cells, or Chinese 
25 hamster ovary cells (CHO cells)), and may be obtained from a wide variety of developmental 
stages, including for example, adult, neonatal, fetal, and embryo. 

Bacterial hosts suitable for production of VIPs of the invention include E. coli. Bacillus 
subtilis, and Streptoccocus spp. Yeast hosts suitable for production of VIPs of the invention 
include Saccharomyces cerevisiae, Candida albicans, Candida maltosa, Hansemda polymorpha, 
10 Kluyveromyces fragilis, Kluyveromyces lactis, Pichia guUlerimondii, Pichia pastoris, 

Schizosaccharomyces pombe and Yarrowia lipolytica. Insect cells suitable for production of 
VIPs of the invention (i.e., via baculovirus expression vectors) include Aedes aegypti, 
Autographa califomica, Botnbyx mori. Drosophila tnelanogaster, Spodpterafrugiperda, and 
Trichoplusia ni. 

15 Viral vectors can be used for the production of particles in eukaryotic cells, such as those 

derived from the pox family of viruses, including vaccinia virus and avian poxvirus. Additional, 
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vaccima based mfection/tiaDsfectioD systems, such as those as described in Tomei etaL,J. Virol 
(1993) 67:4017-4026 and Selby et oL, J. Gen. Virol. (1993) 74:1103-1113, can also be used to 
generate the VLPs of the invention, hi this system, cells are first transfected in vitro witii a 
vaccinia virus recombinant that encodes the bacteriophage T7 RNA polymerase. This 
5 polymerase only transcribes templates bearing T7 promoters. Following infection, cells are 
transfected with die DNA of interest, driven by a T7 promoter. The polymerase expressed in the 
cytoplasm from the vaccinia virus recombinant transcribes the transfected DNA into RNA which 
is then translated into protein by the host translation machinery. The method provides for hi^ 
level, transient, cytoplasmic production of large quantities of RNA and its translation products. 

10 Depending on the expression system and host selected, the VLPs are produced by growing 

host cells transformed by an expression vector under conditions whereby the particle-forming 
polypeptide is expressed and VLPs can be formed. The selection of the appropriate growth 
conditions is within the skill of the art. If the VLPs are formed intracellularly, the cells are then 
disrupted, usiiig chemical, physical or mechanical means, which lyse the cells yet keqp the VLPs 
.5 substantially intact. Such methods are known the those of skiU in the art and are described in,- 
e.g.. Protein Purification Applications: A Practical Approach, (E.L.V. Harris and S. Angal, 

Eds., 1990). 

The particles are then isolated using methods that preserve the integrity thereof, such as by 
gradient centrifugation, e.g., cesium chloride (CsCl) and sucrose gradients, and the like (see, e.g., 
'0 Kimbauer etal.,J. Virol. (1993) 67:6929-6936), ion exchange chromatography (including anion 
'exchange chromatography such as DMAB and TMAE), hydroxyapatitie chromatography (see 
WO 00/09671), hydrophobic interaction chromatography, gel filtration chromatography and 
other filtration methods such as nanometric filtration and ultrafiltration. Preferably at least one 
anion exchange stq> is performed during purification, and more preferably at least two anion 
S exchange steps are used. 

VLP formulations of the invention may be further processed by methods known in the art 
to disassemble the VLPs into smaller, protein containing moieties using a high concentration of 
reducing agent, followed by reassembly of the VLPs by either removal of the reducing agent or 
by addition of excess oxidant The resulting reassembled VLPs ma y have improved 
0 homogeneity, stability and immunogenic properties. In addition, further ther^utic or 

prophylactic agents may be formulated into the VLPs upon reassembly. See McCarthy et oL, J. 
Virology (1998) 72(1):32^1. See also WO 99/13056 and WO 01/42780. Reducing agents 
suitable for use in VLP disassembly include sulfhydryl reducing agents (such as glutathion, beta 
mercaptoethanol, dithiothreitol, dithioerythritol, cysteine, hydrogen sulfide and mixtures thereof) 

5 preferably contained in moderate to low ionic strength buffers. Sufficient exposure time of the 
VLPs to the reducing agent will be required to achieve a suitable amount of VLP disassembly. 

-154- 




wo 2004/092360 



PCT/DS2004/011710 



Adjuvants may be added to the VLPs of the invention to enhance the inomunogenicity of 
the SARS viral antigens. Antigens suitable for use with VLPs include those described, supra. 
For example, the VLPs of the invention may be adsorbed onto an al uminum adjuvant 
The VLPs of the invention may formulated to enhance their stability. Additional 
5 components which may enhance the stability of a VLP formulation include salts, buffers, non- 
ionic surfactants and other stabilizers such as polymeric polyanion stabilizers. See WO 
00/45841. 

The ionic strength of a solution comprising VLP particles may be main tained by the 
presence of salts. Almost any salt which can contribute to the control of the ionic strength may 
.0 be used. Preferred salts which can be used to adjust ionic strength include physiologically 
acceptable salts such as NaCl, KQ. Na 2 S 04 , ( 1 ^) 2804 , sodium phosphate and sodium citrate. 
Preferably, the salt component is present in concentrations of from about 0.10 M to 1 M. Very 
high concentrations are not preferred due to the practical limitations of parenteral injection of 
high salt concentrations. Instead, more moderate salt concentrations, such as more physiological 
5 concentrations of about 0.15M to about 0.5M with 0.15M-0.32M NaQ are preferred. 

Buffers may also be used to enhance the stability of the VLP formulations of the invention. 
Preferably, the buffer optimizes the VLP stability while maintainirig the pH range so that the 
vaccine formulation will not be irritating to the recipient. Buffers preferably maintain the pH of 
the vaccine formulation within a range of p/H 5.5-7.0, more preferably 6.0-6.5. Buffers suitable 
0 for vaccine formulations are known in the art and include, for exarriple, histidine and imidazole. 
Preferably, the concentration of tile buffer will range from about 2mM to about 100 mM, more . 
preferably 5 mM to about 20 mM. Phosphate containing buffers are generally not preferred 
when the VLP is adsorbed or otherwise formulated with an aluminum compound. 

Non-ionic surfactants may be used to enchance the stability of the VLP formulations of the 

5 mvention. Surfactants suitable for use in vaccine formulations are known in the art and include, 

for example, polyoxyethylene sorbital fatty acid esters (Polysorbates) such as Polysorbate 80 
(e.g., TWEEN 80), Polysorbate 20 (e.g., TWEEN 20 ), polyoxyethylene alkyl ethers (e.g., Brij 
35, Brij 58), as weU as others, including Triton X-100, Triton X-114, NP-40, Span 85 and the 
Pluronic series of non-ionic surfactants (e.g., Pluronic 121 ). The surfactant is preferably present 
) in a concentration of from about 0.0005% to about 0 . 5 % (wt/vol). 

Polymeric polyanion stabilizers may also be used to enchance the stability of the VLP 
formulations of the invention. Suitable polymeric polyanionic stabilizers for use in the invention 
comprise either a single long chain or multiple cross Unked chains; either type possessing 
multiple negative charges along the chains when in solution. Examples of suitable polyanionic 

> polymers include proteins, polyanions, peptides and polynucelic acids. Specific examples 

include caiboxymethyl cellulose, heparin, polyamino adds (such as poly(Glu), poly(Asp), and 
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Poly (GIu, Phe), oxidized glutathione, polynuceltodies, RNA, DNA and serum albumins. Hie 
concentration of the polmeiic polyanion stabilizers is preferably fiurn about 0.01% to about 
0.5%, particularly about 0.05-0.1% (by weigjit). 

G. Passive Immu nization via Antibodies to the SARS Antigens of the Invention 
5 The invention includes antibodies specific to the SARS antigens of the invention and 

methods of treatment or prevention of SARS virus related disease by administrating an effective 
amount of SARS antibodies to a mammalian subject Antibodies specific the SARS antigens can 
be produced by one skilled in the art. Preferably, the antibodies are specific to the spike (S) 
protein of the SARS virus. Potent neutralization of the SARS corona virus using a human 
10 monoclonal anti-spike antibody has been reported (Sui et al. (2004) PNAS USA 101:2536-2541). 

A IgGl form of the monoclonal antibody showed a higher affinity (1.59 nM) than a scFv form 
(32.3nM). 

The antibodies of the invention are specific and selective to SARS antigens. 

In one embodiment, the antibodies of the invention are gener^ed by administering a SARS 
15 antigen to an animal. The method may also include isolating the antibodies fiom the animal. 

Hie antibodies of the invention may be polyclonal or monoclonal antibody preparations, 
monospecific antisera, human antibodies, or may be hybrid or chimeric antibodies', such as 

humanized antibodies, altered antibodies (Fab ’)2 fragments, F(ab) fragments, Fv Augments, 

single-domain antibodies, dimeric or trimeric antibody fragments or constructs, minibodies, or 
10 functional fragments thereof which bind to the antigen in question. 

Antibodies are produced using techniques well known to those of skill in the art and 

disclosed in, for example, US Patent Nos. 4,01 1,308; 4,722,890; 4,016,043; 3,876,504; 
3,770,380; and 4372,745. For example, polyclonal antibodies are generated by immunizing a 
suitable a nim a l , such as a mouse, rat, rabbit, sheep, or goat, with an antigen of interest. In order 
55 to enhance immunogenicity, the antigen can be linked to a carrier prior to imrhunization. Such 
earners are well known to those of ordinary skUl in the art. hnmunization is generally performed 
by mixing or emulsifying the antigen in saline, preferably in an adjuvant such as Freund’s 
complete adjuvant, and injecting the mixture or emulsion parenterally (generally subcutaneously 
or intramuscularly). The animal is genmally boosted 2-6 weeks latCT with one or more injections 
0 of the antigen in saline, preferably using Freund’s incomplete adjuvant Antibodies may also be 
generated by in vitro immunization, using methods known in the art Polyclonal antiserum is 
then obtained from the immunized anim al 

Monoclonal antibodies are generally prepared using the method of Kohler & Milstein 
(1975) Nature ^6:495-497 , or a modification thereof. Typically, a mouse or rat is immimj/^xl as 
5 described above. Rabbits may also be used. However, rather than bleeding the animal to extract 

serum, the spleen (and optionally several large lymph nodes) is removed and dissociated into 
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single cells. If desired, the spleen cells may be screened (after removal of non-specifically 
adhaent cells) by applying a cell suspension to a plate or well coated with the antigen. B-cells, 
expressing membrane-bound immunoglobulin specific for the antigen, will bind to the plate, and 
are not rinsed away with the rest of the suspension. Resulting B-cells, or all disynni a ted spleen 
5 cells, are then induced to fuse widi myeloma ceUs to form hybridomas, and are cultured in a 
selective medium (e.g., hypoxanthine, aminopterin, thymidine medium, “HAr*). The resulting 
hybridomas are plated by limitin g dilution, and are assayed for the production of antibodies 
which bind specificaUy to the i mmunizin g antigen (and which do not bind to unrelated antigens). 
The selected monoclonal antibody-secreting hybridomas are then cultured either in vitro (e.g., in 
10 tissue culture bottles or hollow fiber reactors), or in vivo (e.g., as ascites in mice). 

Humanized and chimeric antibodies are also useful in the invention. Hybrid (chimeric) 
antibody molecules are generally discussed in Winter et ed. (1991) Nature 349: 293-299 and US 
Patent No. 4,816,567. Humanized antibody molecules are generally discussed in Riechmarm et 
al. (1988) Nature 332:323-327; Verhoeyan et al (1988) Science 239:1534-1536: and U.K 
15 Patent Publication No. GB 2,276,169, published 21 September 1994). One approach to 

engineeiing a humanized antibody involves cloning recombinant DNA containing the promoter, 
leader, and variable-region sequences from a mouse antibody gene and the constant-region exons 
from a human antibody gene to create a mouse-human chimera, a humanized antibody. See 
, generally, Kuby, “Immunology, 3”* Edition”, W.H. Freeman and Company, New York (1998) at 
10 page 136. 

Antibody fragments which retain the ability to recognize a SARS antigen are also included ; 
within the scope of the invention. A number of antibody fragments are known in the art which 
comprise antigen-binding sites capable of exhibiting immunological binding properties of an 
intact antibody molecule. For example, functional antibody fragments can be produced by 
15 cleaving a constant region, not responsible for antigen binding, from the antibody molecule, 

using e.g., pepsin, to produce F(ab ’)2 fragments. These fragments will contain two antigen 
binding sites, but lack a portion of the constant region from each of the heavy chains. Similarly, 
if desired. Fab fragments, comprismg a single antigen binding site, cm be produced, e.g., by 
digestion of polyclonal or monclonal antibodies with papain. Functional fragmwite, includnig 
10 only the variable regions of the heavy and light chains, can also be produced, using standard 
techniques such as recombinant production or preferential proteolytic cleavage of 
immunoglobulm molecules. These fragments are known as Fy. See, e.g.. Inbar et al (1972) 

Proc. Nat. Acad. Sci USA ^:2659-2662; Hochman et al. (1976) Biochem 15:2706-2710; and 
Ehrhch et al (1980) Biochem 19:4091^6. 

•5 A single-chain Fv (“sFv” or scFv”) polypeptide is a covalently linked Vh-Vl heterodimer 

which is expressed fiom a gene fusion including Vn-and Vl- encoding genes linlrftrf by a peptide- 
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encoding linker. Huston aL (1988) Proc. Nat. Acad. ScL USA M:5879-5883. A number of 
methods have been described to discern and develop chemical stracutres (luikers) for converting 
the naturally aggregated, but chemicaUy separated, light and heavy polypeptide chains fiom an 
antibody V region into an sFv molecule which will fold into a three dimensional structure 
5 substantially similar to the structure of an antigen-binding site. See. e.g., US Patent Nos. 

5,091,513; 5,132,405; and 4,946,778. The sFv molecules may be produced using methods 

described in thea rL See, e.g., Huston et al. (1988) Proc. Nat. Acad. Sci USA M:5879-5338; US 
Patent Nos. 5,091,513; 5,132,405 and 4,946,778. Design criteria include determining the 
appropriate length to span the distance between the C-tenninus of one chain and the N-terminus 
10 of the other, wherein the linker is generaUy formed from smaU hydrophilic amino acid residues 

that do not coil or form secondary structures. Such methods have been described in the art See 
e.g..USPatentNos.5,091.513; 5,132,405 and 4,946,778. Suitable linkers generaUy comprise ’ 
polypeptide chains of alternating sets of glycine and serine residues, and may include glutamic 
acid and lysine residues inserted to enhance solubUity. Anti-spike scFv antibodies have been 

5 reported(Suie/o/. (2004) PNAS USA 101:2536-2541). 

‘TSlini-antibodies” or “nrinibodies” wiU also find use with the present invention. 

Minibodies are sFv polypeptide chains which include oUgomerization domains at their C- 

termim, separated from the sFv by a hinge region. Packer a/., (1992) SiocAem 31:1579-1584. 

The oligomerization domain comprises self-associating a-heUces, e.g., leucine zippers, that can 
0 be further stabiUzed by additional disulfide bonds. The oligomerization domain is designed to be 
compatible with vectorial folding across a membrane, a process thought to facUitate in vivo 
folding of the polypeptide into a functional binding protein. GeneraUy. minibodies are produced 
using recombinant methods weU known in the art. See, e.g.. Pack etal., (1992) Biochem 
3j[:1579-1584; Cumber et al. (1992) J. Inmunolosrv 149B :12Q-12fi 
> Non-conventional means can also be used to generate and identify the antibodies of the 

invention. For exan^le, a phage display Ubrary can be screened for antibodies which bind to the 
S ARS antigens of the invention. See generaUy, Siegel, “Recombinant Monoclonal Antibody 
Technology*. Trari^. Clin. Biol. (2002) 9(1): 15-22; Sidhu, “Phage Display in Pharmaceutical 
Biotechnology”, Curr. Opin. Biotechnol. (2000) U(6):610-616; Sharon, et al, “Recombinant 
» Polyclonal Antibody Dbranes”, Comb. Chem. High Throughput Screen (2000) 3(3): 185-196; 
and Schrmtz et al, “Phage Display: A Molecular Tool for the Generation of Antibodies- 
Review”, P/acenra, (2000) 21 SupplA: S106-12. 

The antibodies of the invention may also be generated by administering the polynucleotide 
sequence encoding for the SARS antigen into an animal. The SARS antigen is then expressed in 
vivo, and antibodies specific to the SARS antigen are generated in vivo. Methods for 
polynucleotide deUvery of the SARS antigens of the invention are discussed in section 4 below. 
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The antibodies of the invention are prefonbly specific to the SARS vinis. 

IL Combinations of one or more of a ny of the above aPDroaches in a v/r/'Wn.. 

The compositions of the invention further comprise combinations of one or more of the 
compositions discussed above. For instance, the invention comprises a composition comprising 
an attenuated SARS virus and a subunit SARS viral antigen. 



. Combinations of SABS anti g ens and other Respiratory Virus 



The invention further relates to vaccine formulations comprising one or more SARS virus 
antigens and one or more other respiratory virus antigens. Additional respiratory virus antigens 
suitable for use in the invention include antigens from influenza virus, human rhinovirus (HRV), 
parainfluenza virus (PIV), respiratory syncytial virus (RSV), adenovirus, metapneumovirus, and 
rhinovirus. The additional respiratory virus antigen could also be from a coronavirus other than 
the SARS coronavirus, such as the NL63 human coronavirus (van der Hoek et al. (2004) Nature 

Medicine 10:36^73). Preferably, the additional respiratory virus antigen is an influenza viral 
antigen. 



5 The invention may also comprise one or more bacterial or viral antigens in combination 

with the SARS viral antigen. Antigens may be used alone or in any combination. (See.:«.g., 

WO 02/00249 describing the use of combinations of bacterial antigens). The combinations may 
include multiple antigens from the same pathogen, multiple antigens from different pathogens or 
multiple antigens from the same and from different pathogens. Thus, bacterial, viral, and/or 
) other antigens may be included in the same composition or may be administered to the same 
subject separately. It is generally preferred that combinations of antigens be used to raise an 
inamune response be used in combinations. 

Non-liiniting examples of bacterial pathogens which may be used in the invention include 

diphtheria {See, e.g.. Chapter 3 of Vaccines, 1998, eds. Plotkin & Mortimer (ISBN 0-7216-1946- 
0), staphylococcus {e.g.. Staphylococcus aureus as described in Kuroda et al. (2001) Lancet 
357:1225-1240), cholera, tuberculosis, C. tetani, also known as tetanus {See. e.g., Ch^ter 4 of 
Vaccines, 1998, eds. Plotkin & Mortimer (ISBN 0-7216-1946-0), Gieup A and Group B 
streptococcus (including Streptococcus pneumoniae. Streptococcus agalactiae and Streptococcus 
pyogenes as described, for example, in Watson et al. (2000) Pediatr. Infect. Dis. J. 19:331-332; 
Rubin ^ al. (2000) Pediatr Gin. North Am. 47:269-284; Jedrzejas etal. {2001) Microbiol Mol ’ 
Biol Rev 65:187-207; Schuchat (1999) lancet 353:51-56; GB patent applications 0026333.5; 
0028727.6; 015640.7; Dale et al. (1999) Infect Dis Clin North Am 13:227-1243; Ferretti et aL 
(2001) PNAS USA 98:4658-4663), pertussis {See. e.g.. Gusttafsson et al. (1996) N. Engl J. Med. 
334:349-355; RappuoU ^ al. (1991) TTBTECH 9:232-238), meningitis. Moraxella catarrhalis 
{See, e.g.. McMichael (2000) Vaccine 19 Suppl. 1:S101-107) and other pathogenic states. 
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includmg, without limitation. Neisseria meningitides (A, B, C, Y), Neisseria gonorrhoeae (See, 
e,g., WO 99/24578; WO 99136544; and WO 99/57280), HeUcobacter pylori (e.g., CagA, VacA, 
NAP, HopX, HopY and/or urease as described, for example, WO 93/18150; WO 99/53310; WO 
98/04702) and Haemophilus influenza. Hemophilus influenza type B (HIB) (See, e.g., 

5 Costantino et al (1999) Vaccine 17:1251-1263), Porphyromonas gingivalis (Ross et oL (2001) 
Vaccine 19:4135-4132) and combinations thereof. 

Non-limiting examples of viral pathogens which may be used in the invention include 
meningitis, ihinovirus, influenza (Kawaoka et al.. Virology (1990) 179:759-767; Webster et al., 
"Antigenic variation among type A influenza viruses," p. 127-168. In: P. Palese and D.W. 

10 Kingsbury (e<L), Genetics of influenza viruses. Spiinger-Verlag, New York), respiratory 

syncytial virus (RSV), parainiflu^a virus (PIV), rotavirus (e.g., VPl, VP2, VP3, VP4, VP6, 

^ VP7, NSPl, NSP2, NSP3, NSP4 or NSP5 and other rotavirus antigens, for example as described 
in WO 00/26380) and the like. Antigens derived from other viruses will also find use in the 
present invention, such as without limitation, proteins fiom members of the families 
15 Picomaviridae (e.g., polioviruses, etc. as described, for example, in Sutter et al. (2000) Pediatr 

Clin NorthAm 47:2S7-30S;Zimmeimm&Sp2am (1999) Am Font Physician 59:li3-llS; 125- 
126); Caliciviridae; Togaviridae (e.g., rubella virus, etc.); Haviviridae, including the genera 
flavivirus (e.g., yellow fever virus, Japanese encephalitis virus, serotypes of Dengue virus, tick 
borne encephalitis virus. West Nile virus, St. Louis encephalitis virus); pestivirus (e.g., classical 
iO porcine fever virus, bovine viral diarrhea vims, border disease virus); and hepacivims (e.g., 

hepatitis A, B and C as described, for example, in US Patent Nos. 4,702,909; 5,01 1,915; 
5,698,390; 6,027,729; and 6,297,048); Parvovirus (e.g., parvovirus B 19); Coronaviridae; 
Reoviridae; Bimaviridae; Rhabodoviridae (e.g., rabies virus, etc. as described for example in 
Dressen et al. (1997) Vaccine 15 Suppl:s2-6; MMWR Morb Mortal Wkly Rep. 1998 Jan 
15 16:47(1): 12, 19); Filoviridae; Paramyxoviridae (e.g. , mumps virus, measles virus, respiratory 

syncytial virus, etc. as desmibed in Chapters 9 to 11 of Vaccines, 1998, eds. Plotkin & Mortimer 
(ISBN 0-7216-1946-0); Orthomyxoviridae (e.g., influenza virus types A, B and C, etc. as 
described in Chapter 19 of Vaccines, 1998, eds. Plotkin & Mortimer (ISBN 0-7216-1946-0),.); 
Bunyaviridae; Arenaviridae; Retroviradae (e.g., HTLV-1; HTLV-11; HIV-1 (also known as 
SO HTLV-in, LAV, ARV, HTIJR, etc.)), including but not limited to antigens from the isolates 
HTVIUb, HIVSF2, fflVLAV, HIVI-AL, I-IIVMN, SF162); HIV- 1 CM235, HTV- 1 US4; HIV-2; 
simian immunodeficiency virus (SIV) among others. Additionally, antigens may also be derived 
from human papilloma virus OHPV) and the tick-borne encephalitis viruses. See, e.g. Virology, 
3”* Edition (W.K. Joklik ed. 1988); Fundamental Virology, 2nd Edition (B.N. Helds and D.M. 

>5 Krnpe, eds, 1991), for a description of these and other viruses. 
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Proteins may also be derived from the herpesvirus family, including proteins derived fiom 
herpes simplex virus (HSV) types 1 and 2, such as HSV-1 and HSV-2 glycoproteins gB, gD and 
antigens derived from varicella zoster virus (VZV), Epstein-Barr virus OEBV) and 
cytomegalovirus (CMV) including CMV gB and gH (See, US Patent No. 4,689,225 and PCX 
5 Publication WO 89/07143); and antigens derived from other human herpesviruses such as HBIV6 
and HHV7. (See, e.g. Chee et aL, Cytomegaloviruses (J.K. McDougall, ed.. Springer- Verlag 
1990) pp. 125-169, for a review of the protein coding contort of cytomegalovirus; McGeoch et 
oL,J. Gen. Virol (1988) ^:1531-1574, for a discussion of the various HSV-1 encoded proteins; 
US Patoit No. 5,171,568 for a discussion of HSV-1 and HSV-2 gB and gD protons and the 
10 genes encoding therefor, Baer et al.. Nature (1984) 310:207-211. for the identification of protein 
coding sequences in an EBV genome; and Davison and Scott, J. Gen. Virol (1986) 62: 1759- 
1816, for a review of VZV). Herpes simplex virus (EKV) rgD2 is a recombinant protein 
produced in genetically engineered Chinese h^ter ovary cells. This protein has the normal 
anchor region truncated, resulting in a glycosylated protein secreted into tissue culture medium. 

15 The gD2 can be purified in the CHO medium to greater than 90 % purity. Human 

immunodeficiency virus (HIV) env-2-3 is a recombinant form of the HIV enveloped protein 
produced in genetically engineered Saccharomyces cerevisae. This protein represents the entire 
protein region of HIV gpl20 but is non-gjycosylated and denatured as purified from the yeast. 
HIV gpl20 is a fully glycosylated, secreted form of gpl20 produced in CHO cells in a fashion 
JO similar to the gD2 above. Additional HSV antigens suitable for use in immunogenic 

compositions are described in PCT Publications WO 85/04587 and WO 88/02634, the disclosures 
of which are incorporated herein by reference in their entirety. Mixtures of gB and ^ antigens, 
which are truncated surface antigens lacking the anchor regions, are particularly preferreid. 

Antigens from the hepatitis family of viruses, including hepatitis A virus (HAV) {See, e.g., 

J5 Bell et al (2000) Pediatr Itrfect Dis. J. 19:1 187-1 188; Iwarson (1995) APMIS 103:321-326), 
hepatitis B virus (HBV) {See, e.g., Gerlich et a/. (1990) Vaccine 8 Suppl:S63-68 & 79-80), 
hepatitis C virus (HCV) {See, e.g., PCT/US88/04125, pubUshed European application number 
318216), the delta hepatitis virus (HDV), hepatitis E virus OHEV) and hepatitis G virus (HGV), 
can also be conveniently used in the techmques described hermn. By way of example, the viral 

0 ^ttonnc sequence of HCV is known, as are methods for obtaining the sequence. See, e.g,. 

International Publication Nos. WO 89/04669; WO 90/11089; and WO 90/14436. Also included 
in the invention are molecular variants of such polypeptides, for example as described in 
PCT/US99/31245; PCT/US99/31273 and PCT/US99/31272. The HCV genome encodes several 
viral proteins, including El (also known as E) and E2 (also known as E2/NS1) and an N-terminal 
5 nucleocapsid protein (termed "core") (see, Houghton et al.. Hepatology (1991) 14:381-388, for a 
discussion of HCV proteins, including El and E2). Similarly, the sequence for the 5-antigen 
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from HDV is known (see. e.g., US Patent No. 5,378,814) and this antigen can also be 
conveniently used in the present composition and methods. AdditionaUy, antigens derived from 
HBV, such as the core antigen, the surface antigen, SAg, as well as the presurface sequences, 
pre-Sl and pre-S2 (formerly called pre-S), as well as combinations of the above, such as 
5 SAg/pre-Sl. SAg/pre-S2. SAg/pre-Sypre^2. and pre-S l/pre-S2. will find use herein. See, e.g., 
"HBV Vaccmes - from the laboratory to license: a case study" in Mackett, M. and Williamson, 
JD., Hiwum Vaccmes and Vaccination, pp. 159-176, for a discussion of HBV structure; and US 
Patent Nos. 4.722.840. 5.098.704. 5.324.513. incorporated herein by reference in their entireties; 
Beamesc/al,/. Virol (1995) ©:6833-6838,Bimbaum era/.,/, Virol (1990) 64-3319-3330- 

:0 andaoueroi./. w (1991)^:5457-5464. Eachoftheseproteins,asweUasa^^^ ’ 

fragments thereof, will find use in the present composition and methods. 

Influenza virus is another example of a virus for which the present invention will be 

particularly useful. SpecificaUy. the envelope glycoproteins HA and NA of influenza A are of 

particularinterestforgeneratinganiinmuneresponse. Numerous HA subtypes of influenza A 

5 have been identified (Kawaoka et al.. Virology (1990) 179:759-767; Webster et al., "Antigenic 

variation amongtype A influenza viruses," p. 127-168. In: P. Palese and D.W. Kingsbury (ed.). 
Genetics of influenza viruses. Springer-Verlag. New York). Thus, proteins derived from any of 

these isolates can also be used in the compositions and methods described herein. 

Non-limiting examples of parasitic antigens include those derived from organisms causing 
3 malaria and Lyme disease. 

The methods of the invention comprise administering an immunogenic composition 
comprising a SAKS viral antigen (including one or more of an inactivated SARS virus, an ■ 
attenuated SARS virus, a spht SARS virus preparation or a recombinant or purified subunit 
formulation of one or more SARS viral antigens) to an animal. The immunogenic compositions 

) the invention can comprise an irnmunologically effective amount of the SARS viral 

antigen. An “immimologically effective amount” is an amount sufficient to allow the . 

to raise an immune response to the SARS antigen. 

The immune response preferably involves the production of antibodies specific to the 
SARS antigen. The amount of antibodies produced will vary depending on several factors 
including the animal used, the presence of an adjuvant, etc. 

The immunogenic compositions of the invention may further comprise one or more 
adjuvants. 

The immunogenic compositions of the invention may be administered mucosally. Suitable 
routes of mucosal adrninislration include oral, intranasal, intragastric, pulmonary, intestinal, 
rectal, ocular and vaginal routes. The immunogenic composition may be adapted for mucosal 

administration. For instance, where the composition is for oral administration, it may be in the 
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foim of tablets or capsules, optionally entesric-coated, liquid, transgwiic plants, etc. Where the 
composition is for intranasal administration, it may be in the form of a nasal spray, nasal drops, 
gel orpowcte. 



5 Suitable routes of parent^ administration include intramuscular (IM), subcutaneous, 

intravenous, intraperitoneal, intradermal, transcutaneous, and transdermal {see e.g.. International 
patent appUcation WO 98/20734) routes, as well as delivery to the interstitial space of a tissue. 
The immunogenic composition may be adapted for parenteral administration, for instance in the 
form of an injectable that may be sterile and pyrogen fiee. 

10 Vaccines of the invention may be administered in conjimction with other 

immunoregulatory agents. In particular, compositions will usually include an adjuvant Preferred 
further ac^uvants include, but are not limited to, one or more of the foUowing ^t forth below: 

A. Mineral Containing Compasitinns 

Miheral containing compositions suitable for use as adjuvants in the invention include 
15 mineral salts, such as aluminium salts, and calcium salts. The invention includes mineral salts 
such as hydroxides (e.g. oxjdiydroxides), phosphates (e.g. hydroxyphoshpates, orthophosphates), 
sulphates, etc. (e.g. see ch^ters 8 & 9 of Vaccine design:the subunit and adjuvant approach 
(1995) PoweU & Newman. ISBN 0-306-44867-X.), or mixtures of different mineral compounds, 
with the compounds taking any suitable form (e.g. gel, crystalline, amorphous, etc.), and with 
to adsorption being preferred. The mineral containing compositions may also be formulated as a 
particle of metal salt See WOOO/23105. 

B. Oil-Emulsions 

Oil-emulsion compositions suitable for use as adjuvants in the invention include squalene- 
water emulsions, such as MF59 (5% Squalene, 0.5% Tween 80, and 0.5% Span 85, formulated 
5 into submicron particles using a microfluidizer). See W090/14837. See also, Frey e/ a/., 
“Comparison of the safety, tolerability, and immuno^nicity of a MF59-adjuvanted influenza 
vaccine and a non-adjuvanted influeri^ vaccine in non-elderly adults”. Vaccine (2003) 21:4234- 
4237. ~ 

Particularly preferred adjuvants for use in the compositions are submicron oil-inwater 
0 emulsions. Preferred submicron oil-in-water emulsions for use herein are squalene/water 

emulsions optionally containing varying amounts of MTP-PE, such as a submicron oil-in-water 
emulsion containing 4-5% w/v squalene, 0.25-1.0% w/v Tween 80 ™ (polyoxyelthylenesorbitan 
monooleate), and/or 0.25-1.0% Span 85™ (sorbitan trioleate), and, optionally, N-acetylmuramyl- 

I^alanyl-D-isogluatminyl-I^alanine-2-<r-2'-dipalmitoyI-j!n-glycero-3- 
5 huydroxyphosphophoryloxy)-ethylamine (MTP-PE), for example, the submicron oil-in-water 
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emulsion known as "MF59" (International Publication No. WO 90/14837; US Patent Nos 
6^99,884 and 6,451,325, incorporated herein by reference in thdr entireties; and Ott e/ al., 
"MF59 - Design and Evaluation of a Safe and Potent Ac^uvant for Human Vaccines" in Vaccine 
Design: The Subunit and Adjuvant Approach (Powell, iVLF. and Newman, M.J. eds.) Plenum 
5 Press, New York, 1995, pp. 277-296). MF59 contains 4-5% w/v Squalene (e.g., 4.3%), 0.25- 
0.5% w/v Tween 80™, and 0.5% w/v Span 85™ and optionaUy contains various amounts of 
MTP-PE, formulated into submicron particles using a microfluidizer such as Model 1 lOY 
microfluidizer (Microfluidics, Newton, MA). For example, MTP-PE may be present in an 
amount of about 0-500 pg/dose, more preferably 0-250 pg/dose and most preferably, 0-100 
10 pg/dose. As used herein, the term "MF59-0" refers to the above submicron oil-in-water 

emulsion lacking MTP-PE, while the term MF59-MTP denotes a formulation that contains MTP- 
PE. For instance, "MF59-100" contmns 100 pg MTP-PE per dose, and so on. MF69, another 
submicron oil-in-water emulsion for use herein, contains 4.3% w/v squalene, 0.25% w/v Tween 
80™, and 0.75% w/v Span 85™ and optionaUy MTP-PE. Yet another submicron oU-in-water 
5 emulsion is MF75, also known as S^, containing 10% squalene, 0.4% Twedh 80™, 5% 

pluronic-blocked polymer L121, and thr-MDP, also microfluidized into a submicron emulsion. 

MF75-MTP denotes an MF75 formulation that includes MTP, such as from 100-400 pg MTP-PE 
per dose. 

Submicroh oU-in-water emulsions, methods of making the same and immunostimulating 

0 such as muramyl peptides, for use in the compositions, are described in detail in 

International PubUcation No. WO 901 14837 and US Patent Nos. 6,299,884 and 6,45 1,325, 
incorporated herein by reference in their entireties. 

. Complete Freund’s adjuvant (CFA) and incomplete Freund’s adjuvant (ERA) may also be 
used as adjuvants in the invention. 

5 C. Saponin PtwTrmlartnnR 



) 



\ 



S^nin formulations, may also be used as adjuvants in the invention. Saponins are a 
heterologous group of sterol glycosides and triterpmioid glycosides that are found in the bark, 
leaves, stems, roots and even flowers of a wide range of plant species. Saponin from the bark of 
the Qudlaia saponaria Molina tree have been widely studied as adjuvants. Saponin can also be 
commerciaUy obtained from SmUax omala (sarsapriUa), GypsophUla panicidata (brides veU), 
and Saponaria officumalis (soap root). Saponin adjuvant formulations include purified 
formulations, such as QS21, as well as lipid formulations, such as ISCOMs. 



Saponin compositions have been purified using High Poformance Thin Layer 
Chromatography (HP-LC) and Reversed Phase High Performance Liquid Chromatography (RP- 
HPLC). Specific purified firactions using these techniques have been identified, including QS7, 
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QS17, QS18, QS21, QH-A, QH-B and QH-C. Preferably, the saponin is QS21.- A method of 
production of QS21 is disclosed in US Patent No. 5,057,540. Saponin formulations may also 
comprise a sterol, such as cholesterol (see WO 96/33739). 

Combinations of saponins and cholesterols can be used to form unique particles called 
5 Immunostimulating Complexs (KCOMs). KCOMs typically also include a phospholipid such 
as phosphatidylethanolamine or phosphatidylcholine. Any known saponin can be used in 

ISCOMs. Preferably, the ISCOM includes one or more of Quil A, QHA and QHC. ISCOMsare 

further described in EP 0 109 942, WO 96/11711 and WO 96/33739. Optionally, the ISCOMS 
may be devoid of additional detergent See WOOO/07621. 

10 A review of the development of saponin based adjuvants can be found at Barr, et aL, 

“ISCOMs and other saponin based adjuvants”. Advanced Drug Delivery Reviews (1998) 
32:247-271. See also Sjolanda:, et al., “Uptake and adjuvant activity of orally deUyered saponin 
and ISCOM ya<xines”. Advanced Dreg Delivery Reviews (1998) 32:321-338. 

D. Bacterial or Microbial Derivativftg 

.5 Adjuvants suitable for use in the invention include bacterial or microbial derivatives such 

as: ^ 

(1) Non-toxic^ derivatives of enterobacterial lipopolysaccharide (UPS) 

Such derivatives include Monophosphoryl lipid A (MPL) and 3-0-deacylated MPL 

(3dMPL). 3dMPL is a mixture of 3 De-O-acylated monophosphoryl lipid A with 4, 5 or 6 

0 acylated chains. A preferred “small particle” form of 3 De-O-acylated monophosphoryl Upid A 
is disclosed in ^ .0 689 454. Such “smaU particles” of 3dMPL are smaU enough to be sterile 
filtered through a 0.22 micron membrane (see EP 0 689 454). Other non-toxic LPS derivatives 
include monophosphoryl lipid A mirnics, such as aminoalkyl glucosaminide phosphate 
derivatives e.g. RC-529. See Johnson et al. (1999) Bioorg Med Chem Lett 9:2273-2278. 

5 (2) Upid A Derivatives 

lipid A derivatives include derivatives of lipid A fiom Escherichia coli such as OM-174. 
OM-174 is described for example in Meraldi et al., “OM-174, a New Adjuvant with a Potential 
for Human Use, Induces a Protective Response with Administered with the Synthetic C- 
Traninal Fragment 242-310 from the circumsporozoite protein of Plasmodium berghei”, 

) Vaccme (2003) 21:2485-2491; and Pajak, et al., “The Adjuvant OM-174 induces both the 
migration and maturation of murine dendritic cells in vivo”. Vaccine (2003) 21:836-842. 

(3) Inununostimulatory oligonucleotides 

Immunostimulatory oligonucleotides suitable for use as adjuvants in the invention include 
nucleotide sequences containing a CpG motif (a sequence containing an unmethylated cytosine 
followed by guanosine and linked by a phosphate bond). Bacterial double stranded RNA or 
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oligonucleotides containing palindromic or poly(dG) sequences have also been shown to be 
immunostimulatory. 

Hie CpG’s can include nucleotide modifications/analogs such as phosphorothioate 
modifications and can be double-stranded or single-stranded. Optionally, the guanosine ma y be 
5 replaced with an analog such as 2’-deoxy-7-deazaguanosine. See Kandimalla^ et al., “Divergent 
synthetic nucleotide motif recognition pattern: design and development of potent 
immunomodulatory oligodeoxyiibonucleotide agents with distinct cytokine induction profiles”, 
Nucleic Acids Research (2003) 31(9): 2393-2400; WO 02/26757 and WO 99/62923 for 
examples of possible analog substitutions. The adjuvant effect of CpG oligonucleotides is further 
10 discussed in Kiieg, “CpG motifs: the active ingredient in bacterial extracts?”. Nature Medicine 
(2003) 9(7): 831-835; McCluskie, etal, ‘'Parenteral and mucosal prime-boost immunization 
strategies in mice with hepatitis B surface antigen and CpG DNA”, FEMS Immunology and 
Medical Microbiology (2002) 32:179-185; WO 98/40100; US PatoitNo. 6,207,646; US Patent 
No. 6,239,1 16 and US Patent No. 6,429,199. 

15 The CpG sequence may be directed to TLR9, such as the motif GTCGTT or TTCGTT. See 

Kandimalla, et al., “ToD-Uke receptor 9: modulation of recognition and cytokine induction by 
novel synthetic CpG DNAs”, Biochemical Society Transactions (2003) 31 (part 3): 654-;658. The 
CpG sequence may be specific for inducing a Thl immune response, such as a CpG- A ODh(, or 
it may be more specific for inducing a B cell response, such a CpG-B ODN. CpG-A and CpG-B 
10 ODNs are discussed in Blackwell, et oL, “CpG-A-Liduced Monocyte IFN-gamma-Inducible 

Protein-10 Production is Regulated by Plasmacytoid Dendritic Cell Drived BPN-alpha”, J. 
Immunol. (2003) 170(8):4061^68; Krieg, “From A to Z on CpG”, TRENDS in Immunology 
(2002) 23(2): 64-65 and WO 01/95935. Preferably, the CpG is a CpG-A ODN. 

Preferably, the CpG oligonucleotide is constructed so that the 5’ end is accessible for 
55 receptor recogmtion. Optionally, two CpG oligonucleotide sequences may be attached at their 3’ 
ends to form “immunomers”. See, for exarc^le, Kandimalla, et al., “Secondary stractures in CpG 
oUgonucleotides affect immunostimulatory activity”, BBRC (2003) 306:948-953; gt 

al., “ToU-like receptor 9: modulation of recognition and cytokine induction by novel synthetic 
GpG DNAs”, Biochemical Society Transactions (2003) 310part 3):664-658; Bhagat et al., “CpG 
lO penta- and hexadeoxyribonucleotides as potent immunomodulatory agents” BBRC (2003) 

300:853-861 and WO 03/035836. 

(4) ADP-ribosylating toxins and detoxified derivatives thereof. 

Bacterial ADP-ribosylating toxins and detoxified derivatives thereof may be used as 
adjuvants in the invention. Preferably, the protein is derived from E. coli (ie., E. coli heat labile 
5 enterotoxin "LT), cholera (“CT”), or pertussis (“PT”). The use of detoxified ADP-ribosylating 
toxins as mucosal adjuvants is described in WO 95/17211 and as parenteral adjuvants in WO 
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98/42375. Preferably, the adjuvant is a detoxified LT mutant such as LT-K63, LT-R72, and 
LTR192G. The use of ADP-iibosylating toxins and detoxified deiivaties thereof, particularly 
LT-K63 and LT-R72. as adjuvants can be found in the foUowing references, each of which is 
spedficaUy incorporated by reference herein in their entirety: Beignon, et aL, “The LTR72 
5 Mutant of Heat-I^ile Enterotoxin of Escherichia coli Enahnces the AbiUty of Peptide Antigens 

to Elicit CD4+ T Cells and Secrete Gamma Interferon after CoappUcation onto Bare Skin”, 
Infection and Immunity (2002) 70(6):3012-3019; Pizza, et al., “Mucosal vaccines: non-toxic 
derivatives of LT and CT as mucosal adjuvants”. Vaccine (2001) 19:2534-2541; Pizza, et al 
‘XTK63 and LTR72, two mucosal adjuvants ready for clinical trials” Iht J. Med. Microbiol 
.0 (2000) 290(4-5):455-I61; Scharton-Kersten ^ al., ‘Transcutaneous Immunization with Bacterial 

ADP-Ribosylating Exotoxins, Subunits and Unrelated Adjuvants”, Infection and Immunity 
(2000) ^(9):5306-5313; Ryan et al., “Mutants of Escherichia coli Heat-Labile Toxin Act as 

EffectiveMucosalAdjuvantsforNasalDeliveryofanAcellularPertussis Vaccine: Differential 

Effects of the Nontoxic AB Complex and Enzyme Activity on Thl and Th2 Cefis” Infection and 

5 I«nmunity(1999)S(12):6270-6280;PartidoscraI,‘‘Heat-labileenteroto^ . 

and Its site-directed mutant LTK63 enhance the proliferative and cytotoxic T-ceD responses to • 
intran^y co-immunized synthetic peptides”. Immunol. Lett. (1999) 62(3):209-216; Peppoloni 
et al. , “Mutants of the Escherichia coli heat-labile enterotoxin as safe and strong adjuvmts for . 
intranasal delivery of vaccines”. Vaccines (2003) 2(2):285-293; and Pine et al., (2002) 

) “fiitranasal immunization with influenza vaccine and a detoxified mutant of heat labile • 
enterotoxin from Escherichia coli (LTK63)” J. Control Release (2002) 85(l-3):263-270. 
Numerical reference for amino acid substitutions is preferably based on the alignments of the A 
and B subunits of ADP-ribosylating toxins set forth in Domenighini et al.. Mol. Microbiol 
(1995) 15(6):1 165-1167, specificaUy incorporated herein by reference in its entirety. 

' E- Human InrnmnnmodiilatQis 

Human immunomodulators suitable for use as adjuvants in the invention include cytokines 
such as interleukins (e.g. BUI. IL-2. IL4, IL-5, 11^, IL-7. BL-12, etc.), interferons (e.g. 
interferon-y), macrophage colony stimulating factor, and tumor necrosis factor. 

F. Bioadhesives and Mucoadhesi yes 

Bioadhesives and mucoadhesives may also be used as adjuvants in the invention. Suitable 
bioadhesives include esterified hyaluronic add microspheres (Singh et al. (2001) J. Cant. Rele. 
70:267-276) or mucoadhesives such as cross-linked derivatives of poly(acrylic acid), polyvinyl 
alcohol, polyvinyl pyrollidone. polysaccharides and carboxymethylcellulose. Chitosan and 
derivatives thereof may also be used as adjuvants in the invention. E.g. , WO99/27960 
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G. Microparticles 

Microparticles may also be used as adjuvants in the invention. Microparticles {Le. a 
particje of -lOOnm to ~150/«n in diameter, more preferably ~200nm to ~30/im in diameter, and 
most prefaably ~500nm to ~10/im in diameter) formed from materials that are biodegradable 
5 and non-toxic {e.g. a poly(a-hydroxy arid), a polyhydroxybutyiic arid, a polyorthoester, a 
polyanhydiide. a polycaprolactone. etc.), with polyaactideKX>-glycohde) are preferred, ’ 
optionally treated to have a negatively-charged surface (e.g. with SDS) or a positively-charged 
surface (e.g. with a cationic detergent, such as CTAB). 

H. LiDosome.s 

.0 Examples of liposome formulations suitable for use as adjuvants are described in US 

Patent No. 6,090,406, US Patent No. 5,916.588, and EP 0 626 169. 

L_ PolYoxYethYtene ether and Polyoxye t hylene Ester Pnrmnlatinnc 

Adjuvants suitable for use in the invention include polyoxyethylene ethers and 
polyoxyethylene esters. W099/52549. Such formulations further include polyoxyethylene 

5 ®o^*tan ester surfactants in combination with an octoxynol (W001^1207) as well as 

polyoxyethylene alkyl ethers or ester surfactants in combination with at least one additional non- . 
ionic surfactant such as an octoxynol (WOOl/21152). 

Preferred polyoxyethylene ethers are selected from the foUowing group: polyoxyethylene- 

9-lauryl ether Gaureth 9), polyoxyethylene-9-stBoryl ether, polyoxytheylene-8-steoryl ether. 

) polyoxyethylene-4-lauryl ether, polyoxyethylene-35-lauryl ether, and polyoxyethylene-23-lauryl 
ether. 

J. PolvDhosphaTftne (PCPP't 

PCPP formulations are described, for example, in Andrianov et a/., “Preparation of 
hydrogel microspheres by coacervation of aqueous polyphophazene solutions”. Biomaterials 
(1998) 19(1-3):109-115 and Payne et al., “Protein Release from Polyphosphazene Matrices”, 

Adv. Drag. Delivery Review (1998) ^(3);185-196. 

K. Muramvl rientide-s 

Examples of muramyl peptides suitable for use as adjuvants in the invention include N- 

acetyl-muramyl-L-thieonyl-D-isoglutamine (thr-MDP), N-acetyl-nomiuramyl-L-alanyl-D- 
isoglutamine (nor-MDP). andN-acetylmuramyl-L-alanyl-D-isoglutaminyl-L-alanine-2-(r-2'- 
dipalimtoyl-sn-glycero-3-hydroxyphosphoryloxy)-ethylamineMTP-PE). 

L. Imidaznqni nolone Compounds. 

Examples of imidazoquinolone compounds suitable for use adjuvants in the invention 
mclude Imiquamod and its homologues, described further in Stanley, “Imiquimod and the 
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imidazoqumolones: mechanism of action and ther^utic potential” Clin Exp Dennatol (2002) 
27(7):571-577 and Jones, “Resiquimod 3M”, Curr Opin Livestig Dmgs (2003) 4(2):214-218. 

M. Virosomes and Virus Like Particles (VLPsl 

Vnosomes and Vims Lilce Particles (VU*s) can also be used as adjuvants in the invention. 
5 These stmctures generally contain one or more proteins from a virus optionaUy combined or 

formulated with a phospholipid. They are generally non-pathogenic, non-replicating and 
generally do not contain any of the native viral genome. The viral proteins ma y be recombinantly 
produced or isolated from whole viruses. Hiese viral proteins suitable for use in virosomes or 
VLPs include proteins derived from influenza virus (such as HA or NA), Hepatitis B virus (such 
10 as core or capsid proteins). Hepatitis E virus, measles virus, Sindbis virus. Rotavirus, Foot-and- 

Mouth Etisease virus. Retrovirus, Norwalk vims, human P^illoma virus, HTV, RNA-phages, 
Q6-phage (such as coat proteins), GA-phage, fr-phage, AP205 phage, and Ty (such as 
retrotransposon Ty protein pi). YLPsarediscussedfurthrarin WO 03/024480, WO 03/024481, 
and Niil^ et al., “Chimaric Recombinant Hepatitis E yims-like. Particles as an Oral Vaccine 
15 Vehicle Presenting Foreign Epitopes”, Virology (2002) 2^:273-280; Lenz et al., 

“Papillomaiivurs-Like Particles Induce Acute Activation of Dendritic Cells”, Journal of 

Immunology (2001) 5246-5355; Pinto, eta;/., “Cellular Immune Responses to Human 

Papillomavirus (HPV)-16 LI Healthy Volunteers Immunized with Recombinant HPV-16 LI 
Virus-like Particles”, Journal of Infectious Diseases (2003) 188:327-338: and Gerber et oL, 

to ‘Human Papillomavrisu Virus-like Particles Are Efficient Oral Immunogens- when 

Coadministered with Escherichia coli Heat-Labile Entertoxin Mutant R192G or CpG”, Journal 
of Virology (2001) 75(10):4752-4760. Virosomes are discussed further in, for example, Gluck 
et al., “New Technology Platforms in the Development of Vaccines for the Future”, Vaccine 
(2002)20:B10-B16. 

'■5 TTie invention may also comprise combinations of aspects of one or more of the adjuvants 

identified above. For example, the following adjuvant compositions may be used in the 
invention: 

(1) a saponin and an oil-in-water emulsion (W099/11241); 

(2) a s^nin {e.g.., QS21) + a non-toxic LPS derivative (e.g., 3dMPL) (see WO 

0 94/00153); 

(3) a sq>onin (e.g,., QS21) + a non-toxic LPS derivative (e.g., 3dMPL) + a cholesterol; 

(4) a s^onin (e.g. QS21) + 3dMPL + IH2 (optionally + a sterol) (W098/57659); 

(5) combinations of 3dMPL with, for example, QS21 and/or oil-in-water emulsions (See 
European patent ^plications 0835318, 0735898 and 0761231); 
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(6) SAP, containing 10% Squalane, 0.4% Tween 80. 5% pluronic-block polymer L121, 
and thr-MDP, either microfluidized into a submicron emulsion or vortexed to generate a larger 
particle size emulsion. 

(7) RiW™ adjuvant system (RAS), (Ribi Immunochem) containing 2% Squalene, 0.2% 

5 Tween 80, and one or more bacterial cell wall components from the group consisting of 

monophosphoiylipid A (MPL), trehalose dimycolate (TDh^, and cell wall skeleton (CWS), 
preferably MPL + CWS (Detox™); and 

(8) one or more mineral salts (such as an aluminum salt) + a non-toxic derivative of IPS 
(such as 3dPML). 

10 Alumimum salts and MF59 are preferred adjuvants for parenteral immunisation. Mutant 

bacterial toxins are preferred mucosal adjuv^ts. 

As mentioned above, adjuvants suitable for use in the invention may also include one or 
more of the following: 

- E.coli heat-labile enterotoxin (“LT”). or detoxified mutants thereof, such as the or 

.5 R72 rnutants; 

- cholera toxiu (“CTO, or detoxified mutants thereof; 

- microparticles (i.e., a particle of -lOOnm to ~150/im in diameter, more preferably 
~200nm to ~30/tm in diameter, and most preferably ~500nm to ~10/«n in diameter) formed from 

materials that are biodegradable and non-toxic (e.g. a poly(a-hydroxy acid), a 
0 polyhydroxybutyiic acid, a polyorthoester, a polyanhydride, a polycaprolactone etc.); 

- a polyoxyethylene ether or a polyoxyethylene ester (see fiitemational patent application 
WO 99/52549); 

- a polyoxyethylene sorbitan ester surfactant in combination with an octoxynol (see 
International patent appHcation WO 01/21207) or a polyoxyethylene aikyl ether or ester 

5 surfactant in combination with at least one additional non-ionic surfactant such as an octoxynol 
(see Intemational patent application WO 01/21152); 

- chitosan (e.g. Intemational patent application WO 99/27960) 

- an immunostimulatory oligonucleotide (e.g. a CpG oligonucleotide) and a saponin (see 
International patent application WO 00/62800) 

) - immunostunuiatory double stranded RNA. 

- aluminum compounds (e.g. aluminum hydroxide, aluminum phosphate, aluminum 
hydroxyphosphate, oxyhydroxide, orthophosphate, sulfate etc. (e.g. see chapters 8 & 9 of 
Vaccine design: the subunit and adjuvant (qjroach, eds. Powell & Newman, Plenum Press 1995 
(ISBN 0-306-44867-X) (hereinafter “Vaccine design”), or mixtures of different aluminum 

; compounds, with the compounds taking any suitable form (e.g. gel. crystalline, amorphous etc.), 
and with adsorption being preferred; 
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- MF59 (5% Squalene, 0.5% Tween 80, and 0.5% Span 85, formulated into submicron 
particles using a microfluidizer) (see Chapter 10 of Vaccine design; see also International patent 
application WO 90/14837); 

- liposomes (see Chapters 13 and 14 of Vaccine design) ; 

5 ■ KCOMs (see Ch^ter 23 of Vaccine design); 

- SAP, containing 10% Squalane, 0.4% Tween 80, 5% pluroiiic-block polymer L121, and 
thr-MDP, either microfluidized into a submicron emulsion or vortexed to generate a larger 
particle size emulsion (see Chapter 12 of Vaccine design); 

- Ribi™ adjuvant system (RAS), (Ribi hnmunochem) containing 2% Squalene, 0.2% 

10 Tween 80, and one or more bacterial cell wall components from the group consisting of 

monophosphoryUpid A (MPL), trehalose dimycolate (TDM), and cell wall skeleton (CWS), 

preferably MPL + CWS (Detox™); 

- saponin adjuvants, such as QuilA or QS21 (see Chapter 22 of Vaccate design), also 
known as Stimulon™; 

15 - ISCOMs, which may be devoid of additional detergent (WO 00/07621); 

- complete Freund’s adjuvant (CFA) and incomplete Freund’s adjuvant (IFA); 

- cytokines, such as interleukins (e.g. BL-l, lL-2, H^5, IL-6, 11^7, IL-12, etc.), 

interferons (e.g. interferon-^), macrophage colony stimulating factor, tumor necrosis factor, etc. . . 
(see Chapters 27 Sc 28 of. Vaccine design); 

■0 - monophosphoryl Upid A (MPL) or 3-O-deacylated MPL (3dMPL) (e.g. chapter 21 of . • 

Vaccine. design); 

- combinations of 3dMPL with, for example, QS21 and/or oil-in-water emulsions 
(European patent applications 0835318, 0735898 and 0761231); 

- oligonucleotides comprising CpG motifs (see Krieg (2000) Vaccine, 19:618-622; Krieg 

5 (2001) Curr. Opin. Mol. TTier., 2001, 3:15-24; WO 96/02555, WO 98/16247, WO 98/18810, WO 

98/40100, WO 98/55495, WO 98/37919 and WO 98/52581, etc.) i.e. containing at least one'cG 
dinucleotide, 

- a polyoxyethylene ether or a polyoxyethylene ester (Intemational patent application 
W099/52549); 

) - a polyoxyethylene sorbitan ester surfactant in combination with an octoxynol 

(Intemational patent application WO 01/21207) or a polyoxyethylene alkyl ether or ester 

surfactant in combination with at least one addrtional non-ionic surfactant such as an octoxynol 
(WO 01/21152); 

- an immunostimulatory oligonucleotide (e.g. a CpG oligonucleotide) and a saponin 

) (WOOO/62800); 
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- an immunostimulant and a particle of metal salt (Intemational patent application 
WOOO/23105); 

- a saponin and an oil-in-watCT emulsion (WO 99/11241); 

- a saponin (e.g. QS21) + 3dMPL + DL-12 (optionally + a sterol) (WO 98/57659). 

5 Other adjuvants suitable for mucosal or parenteral administration are also available (e.g. 

see chapter 7 of Vaccine design: the subunit and adjuvant aproach, eds. Powell & Newman, 
Plenum Press 1995 (ISBN 0-306-44867-X). 

Mutants of LT are preferred mucosal adjuvants, in particular the “K63” and “R72” mutants 
{e.g. see Intemational patent appUcation WO 98/18928), as these result in an enhanced immune 
LO response. 

Microparticles are also preferred mucosal adjuvants. These are preferably derived ftom a 
poly(a-hydroxy acid), in particular, from a polyGactide) ("PLA"), a copolymer of DJ^lactide 
and gjycolide or glycolic acid, such as a poly(DJL-lactide-co-glycolide) (“PLG” or ‘TLGA”). or 
a copolymer of DJ^lactide and caprolactone. TTie microparticles may be drived from any of 
.5 various polymeric starting materials which have a variety of molecular weights and, in. the case 
of the copolymers such as PLG, a variety of lactide:glycolide ratios, the selection of which wiU 
be largely a matter of choice, depending in part on the coadministered antigen. 

The SARS virus (inactivated or attenuated), viral antigens, antibodies or adjuvants of the 
invention may be entrapped within the microparticles, or may be adsorbed to them. Entrapment 
0 within PLG microparticles is preferred. PLG microparticles are discussed in further detail in 

Morris etal., (1994), Vaccine, 12:5-11, in chapter 13 of Mucosal Vaccines, eds. Kiyono et al.. 
Academic Press 1996 (ISBN 012410587), and in chapters 16 & 18 of Vaccine design: ■ the 

subunit and adjuvant aproach, eds. Powell & Newman, Plenum Press 1995 (ISBN 0-306-44867- ' 
X). 

5 LT mutants may advantageously be used in combination with microparticle-entrapped 

antigen, resulting in significantly enhanced immune responses. 

Alumimum compounds and MF59 are preferred adjuvants for parenteral use. 

The composition may include an antibiotic. 

The immunogenic compositions of the invention may be administered in a single dose, or 
3 as part of an administration regime. The regime may include priming and boosting doses, which 
may be administered mucosaUy, parentCTally, or various combinations tiiereof. 

The methods of the invention further comprise treating or preventing a SARS virus-related 
disease by administering to an animal a composition comprising an effective amount of the 
antibodies of the invention. An “effective amount” of the antibodies of the invention is an 
) amount sufficient to provide passive immunization protection or treatment to the animal 
Preferably, the antibodies of the invention are specific to the SARS viral anti^n. 
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Methods of treatment may combine both immunogenic compositions and antibody 
compositions. Accordingly the invention comprises a method for treating or preventing a SARS 
virus-related disease comprising administering an immtmog^c composition comprising an 
immunologically effective amount of a SARS viral antigen and administoting an effective 
5 amount of antibodies specific to SARS viral antigen. The immunogenic composition and the 

antibodies may be administered togetho: or separately. Hie invention further comprises a 
composition comprising an immunogenic composition comprising an immunologically effective 
amount of a SARS viral antigen and further comprising an effective amount of antibodies 
specific to a SARS viral antigon. 

10 The SARS viral antigens and antibodies of the invention may also be administered in 

polynucleotide fomL The SARS viral antigens and/or antibody proteins are then expressed in 
vivo. 

. The SARS viral antigens and the antibodies of the invention can also be delivered using 
one or more gene vectors, administered via nucleic acid immunization or the lilfe using standard 
15 gene delivery protocols. Methods for gene delivery are known in the art. See, e.g., US Patent 
Nos. 5399346 , 5,580,859, 5,589,466. The constructs can be delivered (e.g., injected) either ' 
subcutaneously, epidermally, intradermally, intramuscularly, intravenous, mucosahy (such as 
nasally, rcctally and vaginally), intraperitoneally, orally or combinations thereof. Intraniuscular 
injection of 25(ig plasmid DNA encoding spike antigens, in 200/tl PBS pH 7.4, at weeks 0, 3 and 
20 . 6, has been described formice by Yang et al. (2004) Nature 428:561-564. 

An exemplary replication-deficient gene delivery vehicle that may be used in the practice 
of the present invention is any of the alphavirus vectors, described in, for example, US Patent 
Nos. 6,342372; 6,329,201 and International Publication WO 01/92552. 

A number of viral based systems have been developed for gene transfer into mammalian 
25 cells. For example, retroviruses provide a convaiient platform for gene delivery systems. 
Selected sequences can be insmted into a vector and packaged in retroviral particles using 
techniques known in the art. The recombinant virus can then be isolated and delivered to cells of 
the subject either in vivo or ex vivo. A number of retroviral systems have been described (US 
Patent No. 5319,740; Millar & Rosman, BioTechniques (1989) 7:980-990; bfiller, AD, Human 
10 Gene Therapy (1990) 1:5-14; Scarpa et oL, Virology (1991) 180:849-852; Bums et oL, Proc. 
Natl Acad. ScL USA (1993) 90:8033-8037; and Boris-Lawrie & Temin, Cur. Opin. Genet. 
Develop. (1993) 3:102-109. 

A number of adenovirus vectors have also been described. Unlike retroviruses which 
integrate into the host genome, adenoviruses persist extrachromosomally thus mmirnizing the 
15 risks associated with insertional mutagenesis (Haj-Ahmad and Graham, J. Virol (1986) 57:267- 
274; Bett etal,J. Virol (1993) 67:5911-5921; Mittereder etal. Human Gene Therapy (1994) 
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5:717-729; Seth et aL, J. ViroL (1994) 68:933-940; Barr et aL, Gene Therapy (1994) 1:51-58; 
BedmCT, K.L. BioTechruques (1988) 6:616-629; and Rich et aL, Human Gene Thereby (1993) 
4:461-476). Adenoviral delivery of codon-optunsed versions of the genes encoding SARS 
coionavirus structural antigens spike SI, membrane protein and nucleocapsid protein has been 
5 investigated in rhesus macaques and found to invoke a strong neutralizing antibody response 

(Gao etoL (2003) Lancet 362(9399):1895-1896). 

Additionally, various adeno-assodated virus (AAV) vector systems have been developed 
for gene delivery. AAV vectors can be readily constructed using techniques well known in the 
art See, e.g., US Patent Nos. 5,173,414 and 5,139,941; Intemational Publication Nos. WO 
10 92/01070 published 23 January 1992) and WO 93/03769 (published 4 March 1993); Lebkowski 

et aL, Molec. Cell Biol. (1988) 8:3988-3996; Vincent et aL, Vaccines 90 (1990) (Cold Spring 
Harbor Laboratory Press); Carter, B.J. Current Opinion in Biotechnology (1992) 3:533-539; 
Muzyezka, Nv Current Topics in Microbiol and Immunol (1992) 158:97-129; Kotin, R.M. 
Human Gene Therapy (1994) 5:793-801; Shelling and Smith, Gene Therapy (1994) 1:165-169; 
15 andZ3ioueta/.,y.£rp. Med: (1994) 179:1867-1875. • 

Another vector system useful for delivering polynucleotides, mucosaUy and otherwise, is 
the enterically. administered recombinant poxvirus vaccines described by Small, Jr., P.A., et al. ■ 
(US Patent No. 5,676,950, issued October 14, 1997, herein incorporated by reference) as well as 
the vaccinia virus and avian poxviruses. By way of example, vaccinia virus recombinants 
10 expressing the ^nes can be constructed as follows. The DNA encoding the SARS antigen or 
antibody or antibody coding sequence is first inserted into an appropriate vector so that it is 
adjacent to a vaccinia promoter and flanking vaccinia DNA sequences, such as the sequence 
encoding thymidine kinase (TK). This vector is then used to transfect cells that are 
simultaneously infected with vaccirda. Homologous recombination serves to insert the vaccinia ‘ 

t5 promoter plus the gene encoding the coding sequences of interest into the viral genome. Ihe 

resulting TK recombinant can be selected by culturing the cells in the presence of 5- 
bromodeoxyuridine and picking viral plaques resistant thereto. 

Alternatively, avipoxviruses, such as the fowlpox and canarypox viruses, can also be used 
to deliver genes encoding the SARS viral antigens or antibodies of the invention. Recombinant 
■0 avipox Vituses, expressing immunogens from mammalian pathogens, are known to confer 
protective immunity when administered to non-avian species. The use of an avipox vector is 
particularly desirable in human and other mammalian species since members of the avipox genus 
can only productively replicate in susceptible avian species and therefore are not infective in 

mammalian cells. Methods for producing recombinant avipoxviruses are known in the art and 

5 employ genetic recombination, as described above with respect to the production of vaccinia 
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- Viruses. See, WO 91/12882; WO 89/03429; and WO 92/03545. Picomavirus-deiived 
vectors can also be used (See, e.%., US Patent Nos. 5,614,413 and 6,063384). 

Molecular conjugate vectors, such as the adenovirus chimeric vectors described in Michael 
et al., J. Biol. Chem. (1993) 268:6866-6869 and Wagner et al., Proc. Natl Acad, ScL USA 
5 (1992) 89:6099-6103, can also be used for gene delivery. 

A vaccinia based infection/transfection system can be conveniently used to provide for 
inducible, transient expression of the coding sequences of interest (for example, a SARS viral 
antigen or antibody expression cassette) in a host cell. In this system, cells are first infected in 
vitro with a vaccinia virus recombinant that encodes the bacteriophage T7 RNA polymerase. 

10 This polymerase displays exquisite specificity in that it only transcribes tftmplfltp.g bearing T7 
promoters. Following infection, cells are transfected with the polynucleotide of interest, driven 
by a T7 promoter. The polymerase expressed in the cytoplasm from the vaccinia virus 
recombinant transcribes the transfected DNA into RNA that is then translated into protein by the 
host translational machinery. The method provides for high level, transient, cytoplasmic 
15 production of large quantities of RNA and its translation products. See, e.g. , Elroy-Stein and 
Moss, Proc. Natl Acad. Sci. USA (1990) 87:6743-6747; Fuerst et al., Proc. Natl Acad: ScL USA 
(1986) 83:8122-8126. 

As an alternative approach to infection with vaccinia or avipox virus recombinants, or to 
the delivery of genes using odier viral vectors, an anq)lification system can be used that will lead 
iO to high level expression following introduction into host cells. Specifically, a T7 RNA 

polymerase promoter preceding die coding region for T7 RNA polymaase can be engineered. 
Translation of RNA derived from this template will generate T7 RNA polymerase that in turn 
will transcribe more template. Concomitandy, there will be a cDNA whose expression is under 
the control of the T7 promoter. Thus, some of the T7 RNA polymerase generated fiom 
15 translation of the amplification template RNA will lead to transcription of the desired gene. 

Because some T7 RNA polymerase is required to initiate the amplification, T7 RNA polymerase 
can be introduced into cells along with the template(s) to prime the transcription reaction. The 
polymerase can be introduced as a protmn or on a plasmid encoding the RNA polymerase. For a 
further discussion of T7 systems and their use for transforming cells, see, e.g.. International 
10 Publication No. WO 94/26911; Studier and Moffatt, J. Mol Biol (1986) 189:113-130; Deng and 
Wolff, Gene (1994) 143:245-249; Gao et al., Biochem. Biophys. Res. Commun. (1994) 
200:1201-1206; Gao and Huang, Nuc. Acids Res. (1993) 21:2867-2872; Chen et al, Nuc. Acids 
Res. (1994) 22:2114-2120; and US Patent No. 5,135,855. 

The immunogenic compositions of the invention may further comprise diluents, such as 
•5 water, saline, glycerol, ethanol, etc. Additionally, auxiliary substances, such as wetting or 
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emulsifying agents, pH buffering substances, and the like may be included in the immunogenic 
composition. 

The immunogenic compositions used in the invention can be administered to an animal 
Animals suitable for use in the methods of die invention include humans and other primates, 

5 including non-human primates such as chimpanzees, and other apes and monkey species; farm 
animals such as cattle, sheep, pigs, goats and horses, domestic animals such as dogs and cats; 
laboratory animals including rodents such as mice, rats and guinea pigs; birds, including 
domestic, wild and game birds such as chickens, turiceys and other gallinaceous birds, ducks, 
geese and the like. Animals suitable for use in the invention can be of any age, including both 
10 adult and newborn. Transgenic a nimals can also be used in the invention. 

Hie i m mu n ogenic compositions of die invention can be used to treat or prevent S ARS 
virus-related diseases. 

The compositions of the invention are preferably pharmaceutically acceptable and 
pharmacologically acceptable. In particulariy, the compositions are preferably not biologicaUy 
5 or otherwise undesirable, le., the material may be administered to an individual in a formulation 

or composition without causing any undesirable biological effects or interacting in a deleterious 
manner with any of the components of the composition in which it is contained. 

Pharmaceutically acceptable -salts can also be used in compositions of the invention, for 
example, mineral salts such as hydrochlorides, hydiobromides, phosphates, or sulfates, as well as 
0 salts of organic acids such as acetates, proprionates, malonates, or benzoates. Especially useful 
protein substrates are serum albumins, keyhole limpet hemocyanin, immunoglobulin molecules, 
thyroglobulin, ovalbumin; tetanus toxoid, and other proteins well known to those of skill in the 
art Compositions of the invention can also contain liquids or excipients, such as water, saline, 
glycerol, dextrose, ethanol, or the like, singly or in combination, as well as substances such as 
5 wetting agents, emulsifying agents, or pH buffering agents. Uposomes can also be used as a 
earner for a composition of the invention. 

SARS specific reagents and analytical assays may be used in the manufacture and testing 
of the vaccines of the invention. Such analytical assays include, for example: 1) virus titration 
and plaque assays for quantitation of infectious virus particles, 2) a neutralization assay with 
) constant virus and varying serum dilutions, 3) a two step RT-PCR system (Ught Cycler-Roche) 
for detection of negative strand viral RNA, with the target sequence located within the N gene, 
providing highest possible sensitivity, and 4) ELBA and western blot assays for detection and 
qualification of viral proteins. 

In addition, rabbit polyclonal antisCTum has been generated to obtain antibody reagents 
» (and demonstrate induction of neutralizing antibodies) against the SARS-CoV. A sample 
protocol for generating such reagents is set forth below. The virus is first cultivated in suitable 
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ceU culture, such as Vero cells, and peUeted through a 20% sucrose (w/v) cushion. The pellet is 
then subjected to a glycerol potassium-tartrate gradient for further purification. The virus- 
containing fiaction is then diluted and pelleted by ultracentrifugation. The pellet is then 
dissolved in PBS and the virus is inactivated with C3H4O2 (beta-propiolactone, BPL). Two 
5 rabbits are immunized subcutaneously (SC) on day 0, 14, and 28 with 1x10*' inactivated viral 
particles mixed with IFA as adjuvant Rabbits arc bled on days 0 ^nre-inoculation), 13, 28, and 
35 (1 week after 3rd immunization). Sera obtained from this protocol were tested for their 
reactivity against SARS-CoV proteins in western blots and found to react with the major 
structural proteins spike (S), membrane (M), and nucleocapsid (N). 

10 /, Emersins coronavirus vaccinp.s 

The SARS epidemic has lead to increased awareness of viral infections caused by 
coronaviruses. The vaccines of the invention may be adapted to prevent or treat emerging strains 
of coronavirus, including emerging Strains of’SARS virus. 

The mvention provides a vaccine comprising an inactivated (or kiUed) human coronavirus, 
.5 an'attenuated human coronavirus, a split human coronavirus preparation, or a recombinant or 
purified subumt formulation of one or more antigens from a human coronavirus, wherein the 
human coronavirus is not the SARS coronavirus. Optionally, the human coronavirus is not the 
229E coronavirus. OptionaUy, the human coronavirus is not the OC43 coronavirus. OptionaUy, 

the hurnan coronavirus is not the 1^3 coronavirus. Thus the invention provides a vaccine as ' 

0 defined above, wherein the human coronavirus is not the SARS coronavirus, is not the 229E 

coronavirus, is not the OC43 coronavirus and is not the NL63 coronavirus. Such vaccines are 
useful for prevfcnting and/or treating emerging human coronavirus infections. 

The mvention also provides a vaccine comprising: (a) an inactivated (or killed) human 
coronavirus, an attenuated human coronavirus, a split human coronavirus preparation, or a 
5 recombinant or purified subumt formulation of one or more antigens from a human coronavirus, 

wherein the human coronavirus is not the SARS coronavirus, as defined above; and (b) an 
inactivated (or killed) human coronavirus, an attenuated human coronavirus, a split human 
coronavirus preparation, or a recombinant or purified subunit formulation of one or more 
antigens from a human coronavirus, wherein the human coronavirus is the SARS coronavirus. 

) Such vaccines are useful for preventing and/or treating both SARS and other human 
coronaviruses. 

As well as providing vaccines comprising antigens from more than one type of 
coronavirus, the invention also provides vaccines comprising antigens from more than one strain 
of the same coronavirus e.g. different strains of the SARS coronavirus, or different strains of a 
coronavirus other than the SARS coronavirus. In one embodiment, the vaccine comprises 

antigens from at least two strains of coronavirus, or at least three strains of coronavirus. In one 
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embodiment, the vacdne comprises antigens fipom at least two types of coronavirus. In one 
embodiment, the vaccine comprises at least one antigen from each of the known types of 

coionaviiuses (type I, type II and type ID). Such vaccines follow the model of current influenza 
vaccines. 

5 The selection of coronaviruses and/or coronavirus strains for use in vaccines of the 

invention can be based on various criteria. For instance, selection may be based on viruses and/or 
strains drat have been detected in the geographical region (e.g. northern or southern hemisphere, 
a particular country, etc.) where the vaccine targeted. Selection may be based on the results of 
animal surveillance e.g. of viruses detected in cat populations. Selection may be based on the 
10 results of clinical surveillance e.g. of viruses detected in patients hospitalized with respiratory 
infection. Selection may be performed every year e.g. prior to winter. Vaccines may also be 
administered yearly, again following the model of current influenza vaccines. 

Preferred vaccines are sufficiently immunogenic to provide a neutralizing tmrmim. 
response, and more preferably a protective and/or therapeutic immune, response. Particularly 
.5 preferred vaccines meet the efficacy requirements that may be specified by the WHO fix>m time 

to time. 

A preferred subunit antigen for inclusion in vaccines of the invention is a purified spike 
protein, more preferably in oligomeric (e.g. trimeric) form. The spike protein may or my not be 
cleayed e.g. into its SI and S2 products. 

0 The techniques disclosed above for selecting viruses and/or strains for production of 

vaccines can also be used to select appropriate viruses and/or strains from which HRl and HR2 

sequences can Ite obtained for providing ther^utic peptides, as disclosed above. 

in. DIAGNOSTIC COMPOSITIONS AND METHODS OF THE INVENTION 

The invention provides methods for detecting the SARS coronavirus. Detection in patient 

5 samples can be used to detect and diagnose infections by the virus. Detection in donated blood 

can be used to prevent inadvertent transmission of the virus during blood transplant procedures 
Detection methods fall into three main categories: detection of SARS virus nucleic acids; 
detection of SARS virus proteins; and detection of anti-SARS virus immune responses. The 
invention provides aU such methods. 

3 As used herein when referring to nucleotide sequences, particularly oligonucleotide probes 

and primers, “similar” sequences includes those sequences that are at least 90% identical to 
known SARS V genomic sequence and includes sequences that are at least 95 % identical, at 
least 99% identical and 100% identical to the SARSV genomic sequence over the length’ of the 
probe or primer. 
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As used herein, the term "target nucleic add region" or "target nucleic add" denotes a 
nucldc add molecule with a "target sequence" to be amplified. The target nucldc add may be 
either single-stranded or double-stranded and may include other sequences besides the target 
sequence, which may not be amplified. Hie term "target sequence" refers to the particular 
5 nucleotide sequence of the target nucleic add which is to be amplified. The target sequence may 
include a probe-hybridizing region contained within the target molecule with which a probe will 
form a stable hybrid under desired conditions. The "target sequence" may also include the 
complexing sequences to which the oligonucleotide primers complex and be extended using the 
target sequence as a template. Where the target nucleic add is originaUy single-stranded, the 
10 term target sequence" also refers to the sequence complementary to the "target sequence" as 
present m the target nucleic add. If the "target nucldc add" is originaUy double-stranded, the 
term "target sequence" refers to both the plus (+) and minus (-) strands. 

The tenn "primer" or "oligonucleotide primer" as used herein, refers to an oUgonucleotide 
which acts to initiate synthesis of a complementary DNA strand when placed under conditions in 
5 which synthesis of a primer extension product is induced ie. in the presence of nucleotides and a • 
polymerization-inducing agent such as a DNA or RNA polymerase and at suitable temperature, : 
pH, metal concentration, and salt concentration. The primer is preferably single-stranded for 
maximum efficiency in amplification, but may alternatively be double-stranded. If double- 
stranded, the primer is first treated to separate its strands before being used to prepare extension 
0 products. This denaturation step is typicaUy effected by heat, but may alternatively be carried 
out using alkaU, foUowed by neutralization. Thus, a "primer" is complementary to a template, ■ 
and complexes by hydrogen bonding or hybridization with the template to give a primer/template: 
complex for mitiation of synthesis by a polymerase, which is extended by the addition of 

covalently bonded bases linked at its 3' end complementary to the template in the process of 
5 DNA synthesis. 

As used herein, the term "probe" or "oligonucleotide probe" refers to a structure comprised 
of a polynucleotide, as defined above, that contains a nucleic add sequence complementary to a 
nucleic add sequence present in the target nucleic acid analyte. The polynucleotide regions of 
probes may be composed of DNA, and/or RNA, and/or synthetic nucleotide analogs. When an 
) "oligonucleotide probe" is to be used in a 5' nuclease assay, such as the TaqMan™ technique, the 
probe will contain at least one fluorescer and at least one quencher which is digested by the 5’ 
endonuclease activity of a polymerase used in the reaction in order to detect any amplified target 
oligonucleotide sequences. In this context, the oligonucleotide probe will have a sufficient 
number of phosphodiester linkages adjacent to its 5’ end so that the 5' to 3' nuclease activity 
en^)loyed can effidently degrade the bound probe to separate the fluorescers and quenchers. 
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When an oligonucleotide probe is used in the TMA technique, it will be suitably labeled, as 
described below. 

It will be appreciated that the hybridizing sequences need not have perfect 
complementarity to provide stable hybrids. In many situations, stable hybrids will form where 
5 fewer than about 10% of the bases are mismatches, ignoring loops of four or more nucleotides. 

Accordingly, as used herein the term "complementary" refers to an oligonucleotide that forms a 
stable duplex with its "complement" under assay conditions, generally where there is about 90% 
or greater homology. 

Hie terms "hybridize" and "hybridization" refer to the formation of complexes between 
10 nucleotide sequences which are sufficiently complementary to form complexes via Watson-Crick 
base pairing. Where a primer "hybridizes" with target (template), such complexes (or hybrids) 
are sufficiently stable to serve the priming function required by e.g. the DNA polymerase to 
initiate DNA synthesis. 

■ Stringent hybridization conditions will typically include salt concentrations of less than 
15 about 1 M, more usually less than about 500 mM and preferably less than about 200 mM. 
Hybridization temperatures can be as low as 5°C, but are typically greater than 22°C, more 
typically greater than about 30®C, and preferably in excess of about 2>TC. Longer fingments 
may require higher hybridization temperatures for specific hybridization. Other factors may 
affect the stringency of hybridization, including base composition and length of the 
20 complementary strands, presence of organic solvents and extent of base mismatching, md the 
combination of parameters used is more important than the absolute measure of any one alone. 
Other hybridation conditions which may be controlled include buffer type and concentration, 
solution pH, presence and concentration of blocking reagents to decrease background binding 
such as repeat sequences or blocking protein solutions, detergent type(s) and concentraitions, 

25 molecules such as polymers which increase the relative concentration of the polynucleotides, 
metal ion(s) and their concentration(s), chelator(s) and their concentrations, and other conditions 
known in the art. Less stringent, and/or more physiological, hybridization conditions are used 
where a labeled polynucleotide amplification product cycles on and off a substrate linked to a 
complementary probe polynucleotide during a real-time assay which is monitored during PCSl 
JO amplification such as a molecular beacon assay. Such less stringent hybridization conditions can 

reverse 

transcription orPCR. 

As used herein, a "biological sample" refers to a sample of tissue, cells or fluid i.«s r»l aM 
fiom a subject, that commonly includes antibodies produced by the subject. Typical samples 
J5 include but are not limited to, blood, plasma, serum, fecal matter, urine, bone marrow, bile, 
spinal fluid, lymph fluid, san^les of the skin, secretions of the skin, respiratory, intpstinal, and 

-180- 





wo 2004/092360 



PCT/US2004/011710 



genitpurinary tracts, tears, saliva, sputum, mucous, milk, blood cells, organs, tissues, biopsies 
(e.g. lung, liver, kidney) and also samples of in vitro cell culture constituents including but not 
limited to conditioned media resulting ftom the growth of cells and tissues in culture medium 
e.g. recombinant cells, and cell components. Other samples that may be used for diagnosis 
5 include stool ssniples and nasopharyngeal aspirates. 

The term "antibody" encompasses polyclonal and monoclonal antibody preparations, as 
weU as preparations including hybrid antibodies, altered antibodies, chimeric antibodies and, 
humanized antibodies, as well as: hybrid (chimeric) antibody molecules (see, for e xamp le 
Winter et oL (1991) Nature 349:293-299; and US Patent 4,816,567); F(ab ')2 and F(ab) 

10 fragments; Fv molecules (noncovalent heterodimers, see, for exan^rle. Inbar et al. (1972) Proc 
Natl Acad Sci USA ^:2659-2662; and Ehrlich et al. (1980) Biochem 19:4091-4096); 
single-chain Fv molecules (sFv) {see, e.g., Huston etoL {19ZK) Proc Natl Acad Sci USA 
M-5879-5883); oligobodies; dimeric and trimeric antibody fragm^t constructs; minibodies (see, 
e.g. . Pack et al. (1992) Biochem 31: 1579-1584;Xhnnber et al. (1992) J Immunology 
.5 149B . 120-126); humanized antibody molecules (see^, e.g. , Riechmaim et al. (1988) Nature 

332:323-327; Verhoeyan et al. (1988) Science 239:1534-1536; and UK Patent Publication No. 
GB 2,276,169, published 21 September 1994); and, any functional fragments obtained from such 
molecules, wherein such fragments retain specific-binding properties of the parent antibody 
molecule. 

0 As used herein, the term "monoclonal antibody" refers to an antibody composition having 

a homogeneous antibody population. The term is not limited regarding the species or source of 
the antibody, nor is it intended to be limited by the manner in which it is martp. The term 
encompasses whole immunoglobulins. 

; Methods of making polyclonal and monoclonal antibodies are known in the art. Polyclonal 
5 antibodies are generated by immunizing a suitable animal, such as a mouse, rat, rabbit, sheep or ■ 
goat, with an antigen of interest. In order to enhance immunogenicity, the antigen can be linlrpH . 
to a caniCT pnor to immunization. Suitable carriers are typically large, slowly metabolized 
macromolecules such as proteins, polysaccharides, polylactic adds, polyglycolic adds, 
polymeric amino adds, amino acid copolymers, Hpid aggregates (such as oil droplets or 
) liposomes), and inactive virus particles. Such carriers are well knovwi to those of ordinary skill in 

the art Furthermore, the antigen may be conjugated to a bacterial toxoid, such as toxoid fiom 
diphtheria, tetanus, cholera, etc., in order to enhance the immunogmiidty thereof. 

Rabbits, sheep and goats are preferred for the preparation of polyclonal sera when large 
volumes of sera are desired. These animals are good design choices also because of the 
) availability of labeled anti-rabbit, anti-sheep and anti-goat antibodies. Immunization is generally 
performed by mixing or emulsifying the antigpn in saline, preferably in an adjuvant such as 
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Prcund’s complete adjuvant ("FCA"), and injecting the mixture or emulsion parenteraUy 
(generally subcutaneously or intramuscularly). The animal is generaUy boosted 2-6 weeks later 
with one or mote injections of the antigen in saline, prefeably using Freund's incomplete 
adjuvant ("FIA"). Antibodies may also be generated by in vitro immunization, using methods 
5 known in the art. Polyclonal antisera is then obtained from the iTmtumi^wi animal 

Monoclonal antibodies are generally prepared using die method of Kohler & MUstein 
(1975) Nature 256:495-497, or a modification thereof, as described above. 

Nucleic acid detection methods 

There are many well known methods of amplifying targeted sequences, such as the 
10 polymerase chain reaction (PCR), reverse transcription PCR (RT-PCR), the ligase chain reaction 
(LCR), the strand displacement amplification (SDA), and the nucleic acid sequence-based 
amplification (NASBA), transcription-mediated ampUfication (TMA) to name a few. These 
methods are described gener^y in the following references: (PCR) US Patents 4,683,195, 
4,683,202, and 4,800,159; (RT-PCR) US patent 5,310,652, 5,322,770; (LCR) EP Application 
5 No., 320,308 published Jun. 14, 1989; (SDA) US Pat Nos. 5,270,184, and 5,455,166 and 
"Empirical A^ects of Strand Displacement Amplification" by G. T. Walker in PCR Methods 
and Applications, 3(1): 1-6 (1993), Cold Spring Harbor Laboratory Press; (TMA) US Patent No. 
5,399,491, and (NASBA) "Nucleic Acid Sequence-Based Amplification (NASBA™)" by L. 
Malek et al., Ch. 36 in Methods in Molecular Biology, Vol. 28: Protocols for Nucleic Acid 
0 Analysis by Nonradioactive Probes, 1994 Ed. P. G. Isaac, Humana Press. Inc., Totowa, N.J. PCR 

methods may include variations that permit quantitation of the target sequence, for example, by 
real time PCR analysis (e.g., as described in US patents 5,210,015, 5,487,972, 5,994,056, 
6,171,785 inter alia). (Each of the above references are hereby incorporated by reference). 

One embodiment of the method of the invention for detecting the presence of SARS virus 
5 in a sample comprises providing a sample suspected of containing a SARS virus nucleic acid 
target, amplifying a template sequence contained within said SARS virus nucleic acid target by 
any known technique of nucleic acid amplification, including any of those mentioned herein, 
using the oligonucleotide primers described herein, particularly those primers comprising the kits 
described herein, and detecting the amplified template sequmce, wherein the presence of the 
) amplified tenq)late sequence indicates the presence of SARS virus in said sample. 

Amplification techniques generally involve the use of two primers. Where a target 
sequence is single-stranded, the techniques generally involve a preliminary step in which a 
complementary strand is made in order to give a double-stranded target The two primers 
hybridize to different strands of the double-stranded target and are then extended. Ihe extended 
) products can serve as targets for further rounds of hybridization/extersion. The net effect is to 

amplify a template sequence within the target the 5’ and 3' termini of the template being defined 
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by the locations of the two primers in the target As an alternative, if one or both of the primers 
contains a promoter sequence then the target can be amplified (by transcription) using a RNA 
polymerase (as in TMA). 

The present invention provides methods and kits for amplifying and/or detecting a template 
5 or target sequence in the SARSV viral nucleic add. The mvention provides a kit comprising 
primers for amplifying a template sequence contained within a SARSV nucleic acid target, the 
kit comprising a first primer and a second primer, wherein the first primer comprises a sequence 
substantially complementary to a portion of said template sequence and the second primer 
comprises a sequence substantiaDy complementary to a portion of the complement of said 
.0 template sequence, wherein the sequences within said primers which have substantial 
complementarity define the termini of the template sequence to be am plifiprf 

Kits of the invention may further comprise a probe which is substantially complementary - 
to the template sequence and/or to its compl^ent and which can hybridize thereto. This probe 
can be used in a hybridization technique to detect amplified template, or to isolate (Le. “capture) 

5 the amplified template or the original target nucleic acid. 

Kits of the invention may further comprise primers and/or probes for generating and 
detecting an internal standard, in order to aid quantitative measurements (e.g Fille et al. 1997 
Biotechniques 23:34-36). 

Kits of the invention may further comprise a DNA polymerase, which will generally be a ■ 

3 thermostable DNA polymerase where a non-isothermal amplification process is to be used. The 
Mts may also compose suppHes of dNTPs, a magnesium salt (e.g. MgCl 2 ), buffer solutions, etc. 

Kits of the invention may comprise more than one pair of primers (e.g. for nested 

amplification), and one primer may be common to more than one primer pair. The kit may also 
comprise more tiian one probe. 

5 Oligomer Probes and PrimerR 

In connection with the nucleic add detection methods of the present invention described 
above, oligomers having sequence similarity, or complementarity, to the SARSV genome are 
useful. The SARSV genome sequences mentioned herein may be used to produce probes and 
primers which can be used in assays for the detection of nucleic acids in test samples. The probes 
' may be designed fiom conserved nucleotide regions of the polynucleotides of interest or from 
non-conserved nucleotide regions of the polynucleotide of interest The design of such probes for 
optimization in assays is within the skill of those of ordinary sMU in the art. GeneraUy, nucleic 
add probes are developed from non-conserved or unique regions when maximum specificity is 
desired, and nucleic acid probes are developed from conserved regions when assaying for 
nucleotide regions that are closely related to, for example, different members of a multi-gene 
family or in related species like mouse and man 
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Using as a basis Ae SARSV graome which can be found as described herein, and/or 
preferably conserved regions of Ae SARSV genome, and/or Ae particularly described primfir 
and probe sequences as Asclosed herein, oligomers of approximately 8 nucleotides or more can 
be prepared which hybri Aze wiA Ae positive strand(s) of SARSV RNA or its complement, as 
5 well as to SARSV cDNAs. TTiese oligomers can serve as probes for Ae detection (including 
isolation and/or labeling) of polynucleotides which contain SARSV nucleotide sequences, and/or 
as primers for Ae transcription and/or replication of targeted SARSV sequences. The oligomers 
contain a targeting polynucleotide sequence, which is comprised of nucleotides which are 
complementary to a target SARSV nucleotide sequence; Ae sequence is of sufficient lengA and 
10 complementarity wiA Ae SARSV sequence to form a duplex which has sufficient stability for 
Ae purpose intended. For example, if Ae purpose is Ae isolation, via immobilization, of an 
analyte containing a target SARSV sequence, Ae oligomers would contain a polynucleotide 
region which is of sufficient lengA and complementarity to Ae targeted SARSV sequence to 
afford sufficient duplex stability to immobilize Ae analyte on a solid surface, via its binding to 
15 Ae oligomers, under Ae isolation conAtions. For example, also, if Ae oligomers are to serve as 
primers for Ae transcription and/or replication of target SARSV sequences m an analyte 
polynucleotide, Ae oligomers would contain a polynucleotide region of sufficient lengA and 
complementarity to Ae targeted SARSV sequence to allow Ae polymerizing agent to continue ■ 
replication from Ae primers which are in stable duplex form wi A Ae target sequence, under Ae 
10 polymerizing conAtions. For example, also, if Ae oligomers are to be used as label probes, or 
are to bind to multimers, Ae targeting polynucleotide region would be of sufficient lengA and : 
complementarity to form stable hybrid duplex stractures wiA Ae label probes and/or mAtimers 
to allow detection of Ae duplex. The oligomers may contain a Tnininninn of about 4 contiguous 
nucleotides which are complementary to targeted SARSV sequence; usually Ae oligomers will 
55 contain a minimum of about 8 contiguous nucleotides which are complementary to Ae targeted 
SARSV sequence, and preferably will contain a minimum of about 14, 15, 16, 17, 18, 19, 20,21, 
22, 23, 24, 25, 26, 27, 28, 29, or 30 contiguous nucleotides and up to about 50, 75, 100, 200 
contiguous nucleotides or more, which are conq>lementary to Ae targeted SARSV sequence. 

Typically, for use in Ae amplification based meAods (for example, PCR, RT-PCR, TMA) 

•0 oligomras wiU be used as primer sets such that one membCT of Ae primer set has sequence 

similarity or complementarity to a more conserved (among coronaviruses) portion of Ae SARSV 
genome and Ae oAer member of Ae primer set has sequence similarity or comjjlementarity to a 
less conserved portion. The primer sets can be used to amplify Ae target region in ways Aat are 
weU known in Ae art Typically, Ae 5' untranslated region (5TJTR) and Ae 3* untranslated 
5 region (3UTR) are among Ae most conserved regions. Figure 8 shows an alignment of Ae 

5UTR of several coronaviruses. Figure 10 shows an alignment of Ae 3X7111 of several 
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coronaviruses. Hgures 9 and 11 show the sequences of preferred primers for amplification of the 
5TJTR and 3UTR, respectively. Other primers and probes can readily be designed based on the 
sequence aUgrunents provided herein. 

The oligomer, however, need not consist only of the sequence which is complementary to 
5 the tar^ted SARSV sequence. It may contain in addition, nucleotide sequences (eg.promoters) 
or other moieties which are suitable for die purposes for which the oligomers are used. Fbr 
example, if the oligomers are used as primers for the amplification of SARSV sequences via, for 
example, PCR, they may contain sequences which, when in duplex, form restriction enzyme sites 
which facilitate the cloning of the amplified sequences. For exarjq)le, also, if the oligomers are to 
10 be used as "capture probes" in hybridization assays, they would contain in addition a binding 
partner which is coupled to the oligomer containing the nucleotide sequence which is ■ 
complementary to the targeted SARSV sequence. Other types of moieities or sequences which 
are useful of which the obgomers may be comprised or coupled to, are those which are known in 
the art to be suitable for a variety of purposes, including the labeling of nucleotide probes. 

.5 Table 4 (SEQ ID NOS: 1021-6020) shows forward and reverse primers that are useful for 

nucleic acid amplification of SARSV for diagnostic and screeningmethods. 

Preferred primers and probes for SARS nucleic acid detection for diagnostic and screening 
are SEQ ID NOS: 7332-7336 (forward primers), SEQ ID NOS: 7337-7341 (reverse primers) and 
SEQ ID NOS: 7342-7352 (probes). These primers and probes are useful for detection of 
0 sequences in the 3' UTR. 

Any of the above forward primes may be used in combination with any of the above 
reverse primers for amplification of SARSV nucleic acid. The amplified product ma y be- 
detected (or captured) with any of the above jprobes. Particularly preferred combinations of 
forward and reverse primers and the probes for detecting die amplified product include: Forward 
5 SEQ ID NO: 7332 with reverse SEQ ID NO: 7337, 7338, 7339 or 7341 and probe SEQ ID NO: 
7342; forward SEQ ID NO: 7333 or 7334 with reverse SEQ ID NO: 7340 and any of probes 
SEQ ID NO: 7343-7351; Forward SEQ ID NO: 7335 and reverse SEQ ID NO: 7340 or 7341 and 
any of probes SEQ ID NO: 7342-7352. Other combinations of forward and reverse primers and 
^propriate probes can readily be determined by those skUled in the art from the above 
0 information. 

Additional preferred primers and probes for SARS nucleic acid detection for diagnostic 
and screening are SEQ ID NOS: 7353-7362 (forward primers), SEQ ID NOS: 7363-7373 
(reverse primers) and SEQ ID NOS: 7374-7385 (probes). The primers and probes are useful for 
detection of sequences in the 5' UTR. 

5 The above primers may be used in combination for amplification of SARSV nucleic acid 

as follows: any of forward primers SEQ ID NO: 7353-7356 with any of reverse primers SEQ ID 
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NO. 7363-7366, 7368 and the amplified product detected (or captured) with probes SEQ ID NO: 
7374; any of forward primers SEQ ID NO: 7357-7362 with any of reverse primers SEQ ID NO: 
7367, 7369-7373 and the amplified products detected (or captured) with any of probes SEQ ID 
NO: 7375-7385. Particularly preferred combmations of forward and reverse primers and probes 
5 are: Forward primers SEQ ID NO: 7353-7356 with any of reverse primers SEQ ID NO: 7363- 

7366 and probes SEQ ID NO: 7374; forward primers SEQ ID NO: 7357-7358 with reverse 
primers SEQ ID NO: 7367, 7369 and probes SEQ ID NO: 7375 or 7376; Forward primers SEQ 
ID NO: 7357-7359 with reverse primers SEQ ID NO: 7367, 7369 or 7370 and probe SEQ ID 
NO: 7375 or 7376. More preferred are combinations of SEQ ID NO: 7353 or 7354 with SEQ ID 
10 NO: 7363 or 7364 and probe SEQ ID NO: 7374. Other combinations of forward and reverse 
primers and appropriate probes can readily be determined by those skilled in the art from the 
above information. A particularly conserved octanucleotide sequence (SEQ ID NO: 7386) occurs 
in the 3T1TR of SARS (approximately 70-80 bases from the 3' end) and of several other 

Coronaviruses that inay be particularly useful in identifying SARSV. Primers including in this 

.5 region are preferably combined with reverse primers from regions of sequence that are more 
specific for SARS. 

In addition to the above, the intergenic sequence aS) that is characteristic of Coronavirus 
has been identified in SARSV (see above). The IS minimally comprises the sequence ACGAAC 
(SEQ ID NO: 7293) which occurs upstream of each open reading frame (ORF) in the viral 
0 genome. The 5TITR which includes the IS is spliced onto the 5’ end of each viral mRNA at or 
adjacent to the site of the IS. Thus, primers comprising thelS or its complement are useful for 
amphf^g viral nucleic acids, including cDNA made from the viral RNAs. The invention thus • 
comprises a set of primers in which one primer comprises ACGAAC (SEQ ID NO: 7293) or its 
complement (SEQ ID NO: 7387) and one primer comprises any appropriate sequence from the 
5 SARS genome, or a complementary sequence. Useful probes for detecting and/or capturing the 
viral RNAs or cDNA made from the viral RNAs may also comprise the IS sequence, or its 
complement, described above. 

One set of primers for amplification of SARS sequences, particularly by RT-PCR, uses 
SEQ ID NOs 6562, 6563, 6564 and 6565. Of these, 6562 & 6564 are sense primers and 6563 & 

) 6565 are antisense primers. Primers SEQ ID NOS: 6562 & 6565 may be used in a first 

amplification, with a second nested amplification being performed using primers SEQ ID NOS: 
6563 & 6564. In some embodiments of the invention, these four primers are excluded. 

One kit for amplification and detection of SARS sequences, particularly by RT-PCR, uses 
SEQ ID NOs 6567 & 6568 as primers, and SEQ ID NO 6566 as a probe (typically labeled e.g. 

• with TAMRA and/or FAM) for the amplified sequence. In some embodiments of the invention, 
these primers and probe are excluded. 
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One kit for amplification and detection of SARS sequences, particularly by RT-PCR, uses 
SEQ ID NOs 7395 & 6568 as primers, and SEQ ID NO 6566 as a probe (typicaUy labeled e.g. 
with TAMRA and/or FAM) for the amplified sequence. In some embo dimen ts of the invention, 
these primers and probe are excluded. 

5 One kit for amplification of SARS sequences, particularly the nucleoctq)sid gene, uses 

SEQ ID NOs 6560 & 6561 as primers. Tn some embodiments of the invention, these primers are 
excluded. 

One kit for amplification of SARS sequences uses SEQ ID NOs 6496, 6497, 6562, 6563, 
6564 & 6565 as primers, hi some embodiments of the invention, diese primms are excluded. 

10 One kit for amplification of SARS sequences uses SEQ ID NOs 6562, 6563, 6564 & 6565 

as primers. In some embodiments of the invention, these primers are excluded. 

One kit for amplification of SARS sequences uses SEQ ID NOs 6500, 6501, 6502 & 650a 
as primers, hi some embodiments of the invmition, these primers are excluded. 

One kit for amplification of SARS sequences uses SEQ ID NOs 6496, 6497, 6500, 6501, 
15 6502, 6503, 6562, 6563, 6564 & 6565 as primers, hi some embodiments of the invention, these 

primers are excluded. 

One kit for amplification and detection of SARS sequences, particularly by realtime (e.g. - 
TaqMan™) PCR, uses SEQ ID NOs 6567 & 6568 as primers, and SEQ ID NO 6566 as a probe 
(typically labeled e.g. with TAMRA and/or FAM) for the amplified sequence. In some 
iO embodiments of the invention, these primers and probe are excluded. 

One kit for amplification and detection of SARS sequences, particularly by realtime (e.g. 
TaqMan™) PCR, uses SEQ ID NOs 7395 & 6568 as primers, and SEQ ID NO 6566 as a probe • 
(typically labeled e.g. with TAMRA and/or FAM) for the amplified sequence. In some 
embodiments of the invention, these primers and probe are excluded. 

15 One kit for amplification and detection of SARS sequences uses SEQ ID NOs 6562, 6565 

and 6568 as primers, and SEQ ID NOs 7396 and 7397 as probes (typically labeled e.g. with 
TAMRA and/or FAM) for the arnplified sequence, hi some embodiments of the invemtion, these 
primers and probe are excluded. 

One kit for amplification and detection of SARS sequences uses an oligonucleotide 
iO con^>rising SEQ ID NO: 9780 as a forward primer, an oligonucleotide com prisin g SEQ ID NO: 

9781 as a reverse primer, and an oligonucleotide comprising SEQ ID NO: 9782 as a probe. 

Preferred sequences for use with RT-PCR and lightCycler analysis include SEQ ID NOs 
6562, 6568, 6565, 7396 & 7397. hi some embodiments of the invention, these primers and probe 
are excluded. 

•5 The preparation of the oligomers is by means known in the art, including, for example, by 

methods which include excision, transcription, or chemical synthesis. The target g^nenres 
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and/or regions of tbe genome which are selected to which the targeting polynucleotides of the 
oligomras are complementary depend upon the purpose. For example, if the goal is to screen for 
the presence of SARSV in biological samples {e.g. blood, respiratory material, liver, lung), the 
preferred oligomers would be used as probes and/or primers, and would hybridize to conserved 
5 regions of the SARSV genome. Some of the conserved regions of the SARSV genome to which 
the oligomers may bind are described herein, for example, 5XJTR and STJTR. 

In the basic nucleic add hybridization assay, single-stranded analyte nucleic acid (either 
DNA or RNA) is hybridized to a nucleic acid probe, and resulting duplexes are detected. The 
probes for SARSV polynucleotides (natural or derived) are a length which allows the detection 
10 of unique viral sequences by hybridization. While 6-8 nucleotides may be a workable length, 
sequences of 10-12 nucleotides are preferred, and about 13, 14, 15, 16, 17, 18, 19, 20, or 21 or 
more nucleotides or more aj^rears optimal. Preferably, these sequences will derive fiom regions 
which lack heterogeneity. These probes can be prepared using routine methods, including 
automated ohgonuclwtide synthetic methods. Among useful probes, for example, are those 

5 derived from less conserved regions of the SARSV genome. Regions of the genome that are 

typically less conserved can be readily ascertained fiom the sequence alignments provided 
herem, as well as by any other well known techniques. A complement to any unique portion of . 
the SARSV genome wiU be satisfactory. For use as probes, complete complementarity is 
desirable, though it may be unnecessary as the length of the fragment is innoased, 

0 For use of such probes as agents to detect the presence of SARSV polynucleotides (for 

example in screening for contaminated blood or for diagnosing infected individuals), the 
biological sample to be analyzed, such as, without limitation, blood, serum, lung, liver; mucous, ■ 
kidney, saliva, or sputum, may be treated, if desired, to extract the nucleic acids contained 
therein. The resulting nucleic acid fiom the sample may be subjected to gel electrophoresis or 
5 other size separation techniques; alternatively, the nucleic acid sample may be dot blotted 
without size separation, hr order to form hybrid duplexes with the targeting sequence of the 
probe, the targeted region of the analyte nucleic acid must be in single stranded form. Where the 
sequence is naturally present in single stranded form, denaturation will not be required. 

However, where the sequence is present in double stranded form, the sequence will be denatured. 

) Denaturation can be carried out by various techniques known in the art. Subsequent to 

daiaturation, the analyte nucleic acid and probe are incubated under conditions which promote 
stable hybrid formation of the target sequence in the probe with the putative targeted sequence in 
the analyte, and the resulting duplexes containing the probe(s) are detected. 

Detection of the resulting duplex, if any, is usuaUy accomplished by the use of labeled 
probes; alternatively, the probe may be unlabeled, but may be detectable by specific binding with 
a ligand which is labeled, either directly or indirecfly. Suitable labels, and methods for labeling 
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probes and ligands are known in the art, and include, for example, radioactive labels which may 
be incorporated by known methods (e.g., nick translation or kinasing), biotin, fluorescent groups, 
chemiluminescent groups (e.g., dioxetanes, particularly triggered dioxetanes), enzymes, 
antibodies, and the like. 

5 Ihe region of the probes which are used to bind to the analyte can be completely 

complementary to the SARSV genome. Therefore, usually high stringency conditions are 
desirable in order to prevent false positives. However, conditions of high stringency should only 
be used if the probes ate complementary to regions of the viral genome which lack 
heterogeneity. The stringency of hybridization is determined by a number of factors during 
10 hybridization and during the washing procedure, including temperature, ionic strength, length of 
time, and concentration of fonnamide. These factors ate outlined in, for example, Maniatis T. 
(1982). 

Variations of this basic scheme which are known in the art, including those which facilitate 
separation of the duplexes to be detected from extraneous mafftrialg and/or which amplify the 
15 signal from the labeled moiety, may also be used. A number of these variations are reviewed in, 
for example: Matthews & Kricka (1988), Analytical Biochemistry 169: 1 ; Landegren et al. 

(1988), Science 242:229j and Mittlin (1989), Clinical Chem. 35:1819. These and the following 
publications describing assay formats are hereby incorporated by reference herein. Probes 
suitable for detecting SARSV in tiiese assays are comprised of sequences which hybridize with ■ 
20 target SARSV polynucleotide sequences to form duplexes with the analyte strand, wherein the 
duplexes are of sufficient stability for detection in the specified assay .syste m 

A suitable variation is, for example, one which is described in US Pat. No. 4,868,105, 
issued Sep. 9, 1989, and in EPO Publication No. 225,807 (published Jun. 16, 1987). These 
publications describe a solution phase nucleic acid hybridization assay in which the analyte 
25 nucleic acid is hybridized to a labeling probe set and to a capturing probe set The probe-analyte 
complex is coupled by hybridization with a solid-supported capture probe that is complementary 
to the capture probe set This permits the analyte nucleic acid to be removed fiom solution as a 
solid phase complex. Having the analyte in the form of a solid phase complex facilitates 
subsequent separation steps in the assay. The labeling probe set is complementary to a labeled 
10 probe that is bound through hybridization to the solid phase/analyte complex. 

The polymerase chain reaction (PCR) is a techmque for amplifying a desired nucleic acid 
sequence (target) contained in a nucldc acid or mixture thereof. In PCR, a pair of primers are 
einployed in excess to hybridize to the complementary strands of the target nucleic acid. The 
primers are each extended by a polymerase using the target nucleic acid as a template.. The 
!5 extension products become target sequences themselves, following dissociation from the original 
target strand. New primers then are hybridized and extended by a polymaase, and the cycle is 

-189- 




wo 2004/092360 



PCT/US2004/011710 



repeated to geometrically increase the number of target sequence molecules. PCR is HiRclosed in 
US Pat Nos. 4,683,195 and 4,683,202, which are incorporated herein by reference. 

The ligase Chain Reaction (LCR) is an alternate method for nucleic acid anqilification. In 
LCR, probe pairs are used which include two primary (first and second) and two secondary (third 
5 and fourth) probes, all of which are employed in molar excess to target The first probe 

hybridizes to a first segment of the target strand, and the second probe hybridizes to a second 
segment of the target strand, the first and second segments being contiguous so that the p rimar y 
probes abut one another in 5’ phosphate-3’ hydroxyl relationship, and so that a hgase can 
covalently fuse or hgate the two probes into a fused product Jn addition, a third (secondary) 

10 probe can hybridize to a portion of the first probe and a fourth (secondary) probe can hybridize 

to a portion of the second probe in a similar abutting fashion. Of course, if the target is initially 
double stranded, the secondary probes also will hybridize to the target complement in the first 
instance. Once the ligated strand of primary probes is separated from the target strand, it will 
hybridize with the third and fourth probes which can be ligated to form a complementary, 

.5 secondary ligated product It is important to realize that the ligated products are functionally 
equivalent to either the target or its complenient By repeated cycles of hybridization and 
ligation, amplification of the target sequence is achieved This technique is described more 
cornpletely in EP-A-320 308 to K. Backman published Jun. 16, 1989 and EP-A-0439182 to EL 
Backmaner of., published Jul. 31, 1991, both ofwhich are incorporated herein by reference. 

0 For amplification of mRNAs, it is within the scope of the present invention to reverse 

transcribe mRNA into cDNA followed by polymerase chain reaction (RT-PCR); or, to use a 
single enzyme for both steps as described in US Pat No. 5,322,770, which is incorporated herein 
by reference; or reverse transcribe mRNA into cDNA followed by asymmetric gap ligase chain 
reaction (RT-AGLCR) as described by R. L. MarshaU et al., PCR Methods and Applications 
5 4:80-84 (1994), which also is incorporated herein by reference. 

TMA is described in detail in, e.g„ US Patent No. 5,399,491, the disclosure of which is 
incorporated herein by reference in its entirety, hi one example of a typical assay, an isolated 
nucleic acid sample, suspected of containing a SARSV target sequence, is mixed with a buffer 
concentrate containing the buffer, salts, magnesium, nucleotide triphosphates, primers, 

3 dithiothreitol, and spermidine. The reaction is optionally incubated at about 100°C for 
approximately two minutes to denature any secondary structure. After cooling to room 
temperature, reverse transcriptase, RNA polymerase, and RNAse H are added and the mixture is 
incubated for two to four hours at 37°C. The reaction can then be assayed by denaturing the 
product, adding a probe solution, incubating 20 minutes at 60“C, adding a solution to selectively 

5 hydrolyze the unhybridized probe, incubating the reaction six minutes at 60 “C, and measuring 

the remaining chemiluminescence in a luminometer. 
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Generally, TMA includes .tiie following steps; (a) isolating nucleic acid, including RNA, 
from the biological sample of interest suspected of being infected with SARSV; and (b) 
combining into a reaction mixture (i) the isolated nucleic acid, (ii) first and second 
oligonucleotide primers, the first primer having a complexing sequence sufficiently 
5 complementary to the 3' terminal portion of an RNA tar^t sequence, if present (for example the 

(+) strand), to complex therewith, and the second primer having a complexing sequence 
sufficiently complementary to the 3' terminal portion of the target sequence of its complement 
(for example, the (-) strand) to complex therewith, wherein the first oligonucleotide further 
comprises a sequence 5’ to the complexing sequence which includes a promoter, (iii) a reverse 
10 transcriptase or RNA and DNA dependent DNA polymerases, (iv) an enzyme activity which 

selectively degrades the RNA strand of an RNA-DNA complex (such as an RNAse H) and (v) an 
RNA polymerase which recognizes the promoter. 

Ihe components of the reaction mixture may be combined stepwise or at once. The 
reaction mixture is incubated under conditions whereby an oligonucleotide/target sequence is 
15 formed, including DNA priming and nucleic acid synthesizing conditions (including 
ribonucleotide triphosphates and deoxyribonucleotide triphosphates) for a period of time 
sufficient to provide multiple copies of the target sequence. The reaction advantageously takes 
place under conditions suitable for maintaining the stability of reaction components such as the 
component enzymes and without requiring modification or manipulation of reaction conditions 
20 during the course of the amplification reaction. Accordingly, the reaction may take place under 
conditions that are substantially isothermal and include substantially constant ionic strength and 
pH. The reaction conveniently does not require a denaturation step to separate the RNA-DNA 
complex produced by the first DNA extension reaction. 

Suitable DNA polymerases include reverse transcriptases, such as avian myeloblastosis 
25 virus (AMV) reverse transcriptase (available from, e.g., Seikagaku America, hic.) and Moloney 
murine leukemia virus (MMLV) reverse transcriptase (available from, e.g., Bethesda Research . 
Laboratories). 

Promoters or promoter sequences suitable for incorporation in the primers are nucleic acid 
sequences (eitha: naturally occurring, produced synthetically or a product of a restriction digest) 
JO that are specifically recognized by an RNA polymerase that recognizes and binds to that 

sequence and initiates the pnxess of transcription whereby RNA transcripts are produced. The 
sequence may optionally include nucleotide bases extending beyond the actual recognition site 
for the RNA polymerase which may impart added stability or susc^tibility to degradation 
processes or increased transcription efficiency. Examples of useful promoters include those 
*5 which are recognized by certain bacteriophage polymerases such as those fix)m bacteriophage 
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T3, T7 or SP6, or a promote from E. coli. These RNA polymerases are readily available from 
commercial sources, such as New ^gland Biolabs and Epicentre. 

Some of the reverse transcriptases suitable for use in the mediods herein have an RNAse H 
activity, such as AMV reverse transcriptase. It may, however, be preferable to add exogenous 
5 RNAse H, such as E. coli RNAse H, even when AMV reverse transcriptase is used. RNAse H is 
readily available from, e.g., Bethesda Research Laboratories. 

Tlie RNA transcripts produced by these methods may serve as tempi atps to produce 
additional copies of the target sequence through the above-described mechanisms. The system is 
autocatalytic and amplification occurs autocatalytically without the need for repeatedly 
10 modifying or changing reaction conditions such as temperature, pH, ionic strength or the like. 

Detection may be done using a wide variety of methods, including direct sequencing, 
hybridization with sequence-specific oligomers, gel electrophoresis and mass spectrometry, 
these metiiods can use heterogeneous or homogeneous formats, isotopic or nonisotopic labels, as 
well as no labels at aU. 

1 5 Suitable labeling moieties for attachment to primers and/or to probes used in methods of 

the invention include, but are not limited to: 5-FAM (also called 5-carboxyfluorescein; also 
called Spiro(isobenzofuran-l(3H), 9 -(9H)xanthene)-5-carboxylic acid,3’,6'-dihydioxy-3-oxo-6- 
carboxyfluorescein); 5-Hexachloro-Fluorescein ([4,7,2',4’,5',7'-hexachlorD-(3’,6'- 
dipivaloylfluoresceinyl)-6-caiboxylic add ]); 6-Hexachloro-Fluorescein ([4,7,2’,4',.5’,7'- 
10 hexachloro-(3’,6'-dipivaloylfluoresceinyl)-5-carboxylic acid]); 5-Tetrachloro-Fluorescein 
([4,7,2’,7'-tetrachloro-(3',6*-dipivaloylfluoresceinyl)-5- carboxylic acid ]); 6-Tetrachl6ro- 
Fluorescein ([4,7,2',7’-tetrachloro-(3’,6'-dipivaloylfluorescemyl)-6- carboxylic add ]); 
tetramethylrhodamines (TAMRA), including (i) 5-TAMRA (5-carboxytetramethylrhodamine; 
Xanthylium, 9-(2,4-dicarboxyphtaiyl)-3,6- bis(dimethylamino) and (ii) 6-TAMRA (6- 
15 carboxytetramethylrhodamine; Xanthylium, 9-(2,5-dicarboxyphenyl)-3,6- bis(dimethylamino); 
EDANS (5-((2-aminoethyl)amino)naphthalene- 1-sulfonic add) ; 1,5-IAEDANS (5-((((2- 
iodoacetyl)amino)ethyl) amino)naphthalene-l-sulfonic add); DABCYL (4-((4- 
(dimethylamino)phenyl) azo)benzoic aidd); Cy5 (hidodicarbocyanine-5); Cy3 
(Indodicatbocyanine-3); and BODIPY™ FL (4,4-difluoro-5,7-dimethyl-4-bora-3a,4a-diaza-s- 
10 indacene-3-propionic add). Labelling of probes with both FAM (e.g. at S’) and TAMRA (e.g. at 
3') is preferred. 

Nucleic acids of the invention may be used in solution or may be bound to a solid matrix or 
support e.g. in the format of a DNA array. 

As is readily apparent, design of the assays described herdn are subject to a great deal of 
•5 variation, and many formats are known in the art The above descriptions are mwely provided as 
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guidance and one of skiU in the art can leadUy modify the described protocols, using techniques 
well known in the art. 

One 302nt amplicon of the SARS virus is known as “BNI-1” (SEQ ID NO: 9927). It was 
sequenced at Ae Bernhard Nocht Institute, Hamburg, Germany. In April 2003 Ae BNI-1 
5 sequence was published on Ae WHO website Qa^://mm.yi>ho.int/csr/mn/pTimer^ and m 

Dorstoi et aL, “Identification of a Novel Coronavirus in Patients wiA Severe Acute Respiratory 
Syndrome”, New England Journal of Medicine, published online at http://www.nejm.org. BoA 
referaices are mcoiporated herem by reference in Aeir entirety. Some embodiments of Ae 
mvention do not encompass a nucleic acid consisting of SEQ ID NO: 9927. Some oAer 
10 embodiments of Ae mvention do not encompass a nucleic acid comprising SEQ ID NO: 9927. 

Some embodiments of Ae mvention do not encompass a polypeptide consisting of any one of 
SEQ ID NO®: 9928 to 9959. Some oAer embodiments of Ae invention do not encompass a 
nucleic add comprismg any one of SEQ ID NO®: 9928 to 9959. Some embodiments of Ae 
mvention are not subject to Aese exclusions. 

.5 Immunoas.’sav.'i 

The present mvention utilizes various immunoassay techniques for identifying inAviduals 
exposed to SARSV and/or biological samples containing SARSV antigens or antibo Aes to 
SARSV. 

Immimoassav Formate 

B The SARSV antigens may be employed in virtually any assay format Aat employs a 

known antigen to detect antibo Aes. A common feature of all of Aese assays is Aat Ae antigen is 
contacted wiA biological sample suspected of containing SARSV antibodies under conAtions 
Aat permit Ae antigen to bind to any such antibody present in Ae component Such conAtions 
wifi typically be physiologic temperature, pH and ionic strengA usmg an excess of antigen. The 
5 incubation of Ae antigen wiA Ae specimen is followed by detection of immune complexes 

comprised of Ae antigen. Alternatively, anti-SARSV antiboAes may be employed to detect Ae 
presence of SARSV antigens m a biological sample. Combmation antigen/antibody assays are 
also contemplated; for example, as described for HCV detection in US patent 6,63038. 

Design of Ae immunoassays is dibject to a great deal of variation, and many formats are 
3 known in Ae art. Protocols may, for example, use solid supports, or immunoprecipitation. Most 

assays involve Ae use of labeled antibody or polypeptide; Ae labels may be, for example, 
enzymatic, fluorescait, chemiluminescent, raAoactive, or dye molecules. Assays which amplify 
Ae signals from Ae inunune complex are also known; examples of which are assays which 
utilize biotin and aviAn, and enzyme-labeled and meAated immunoassays, such as ELISA 
) assays. 
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The immunoassay may be, without limitation, in a heterogeneous or in a homogeneous 
format, and of a standard or competitive type. la a heterogeneous format, the polypeptide is 
typically bound to a solid matrix or support to facilitate separation of the sample from the 
polypeptide after incubation. Examples of solid supports that can be used are nitrocellulose (e.g., 
5 in membrane or microdter well form), polyvinyl chloride (e.g., in sheets or microtiter wells), 
polystyrene latex (e.g., in beads or microtiter plates, polyvinylidine fluoride, diay/^tiTed paper, 
nylon membranes, microchips, high or low density biochips, recombinant immunoassays 
(RIBA), microfluidity devices, micromagnetic beads, activated beads, and Protein A beads. For 
example, Dynatech frnmunlon or hnmunlon 2 microtiter plates or 0.25 inch polystyrene beads 
10 (Precision Plastic Ball) can be used in the heterogeneous formaL The solid support containing 
the antigenic polypeptides is typically washed after separating it from the test sample, and prior 
to detection of boimd antibodies. Both standard and competitive formats are known in the art. 

In a homogenous format, the test sample is incubated with the combination of antigens in 
solution. For example, it may be undo: conditions that will precipitate any antigen-antibody 
15 complexes which are formed. Both stand^ and competitive formats for these assays are known 
in.theait 

Ih a standard format, the amount of SARSV antibodies in the antibody-antigen complexes 
is directly monitored. This may be accomplished by det ermining whether labeled anti- 
xenogeneic (e.g., anti-human) antibodies which recognize an epitope on anti-SARSV antibodies 
10 will bind due to complex formation. In a competitive format, the amount of SARSV antibodies 
in the sample is deduced by monitoring the competitive effect on the binding of a known amount 
of labeled antibody (or other competing ligand) in the complex. 

Complexes formed comprising anti-SARSV antibody (or in the case of competitive assays, 
the amount of competing antibody) are detected by any of a number of known techniques, 
t5 depending on the format. For example, unlabeled SARSV antibodies in the complex may be 

detected using a conjugate of antixenogwieic Ig complexed with a label, (e.g., an enzyme label). 

In an immunoprecipitation or agglutination assay format the reaction between the SARSV 
antigens and the antibody forms a networic that precipitates from the solution or suspension and 
forms a visible layer or film of precipitate. If no anti-SARSV antibody is present in the test 
■>0 specimen, no visible precipitate is formed. 

There are at least three specific types of particle agglutination (PA) assays. These assays 
are used for the detection of antibodies to various antigens when coated to a support One type of 
this assay is the hemagglutination assay using red blood cells (RBCs) that are sensitized by 
passively adsorbing antigen (or antibody) to the RBC. The addition of specific antigen antibodies 
5 present in the body component, if any, causes the RBCs coated with the purified antigen to 
agglutinate. 
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To eliminate potmitial non-specific reactions in the hemagglutination assay, two artificial 
carriers may be used instead of RBC in the PA. The most common of these are latex particles. 
However, gelatin particles may also be used. The assays utilizing either of these carriers are 
based on passive agglutination of the particles coated with purified antigens. 

5 The SARSV antigens will t 3 /pically be packaged in the form of a kit for use in these 

immunoassays. The kit will normally contain in separate containers the native SARSV antigen, 
control antibody formulations (positive and/or negative), labeled antibody when the assay format 
requires same and signal generating reagraits (e.g., enzyme substrate) if the label does not 
generate a signal directly. TTie native SARSV antigen may be already bound to a solid matrix or 
10 separate with reagents for binding it to the matrix. Instructions (e.g., written, tape, CD-ROM, 

etc.) for carrying out the assay usually will be included in the kit 

Immunoassays that utilize the native SARSV antigen are additionally useful in screening 
blood for the preparation of a supply fix)m which potentially infective SARSV is lacking. The 
method for the preparation of the blood supply comprises the following steps. Reacting a body 
1 5 component preferably blood or a blood component fi»m the individual donating blood with 
native SARSV antigen to allow an immunological reaction between SARSV antibodies, if any, 
and the SARSV antigen. Detecting whether anti-SARSV antibody-SARSV antigen complexes 
are formed as a result of the reacting. Blood contributed to the blood supply is from donors that 
do not exhibit antibodies to the native SARSV antigens. 

iO Production of Antihndies 

As explained above, the assay may utilize various antibodies which may be bound to a 
solid support and that detect antigen or antigen/antibody complexes formed when SARSV 
infection is piresent in the sample. These antibodies may be polyclonal or monoclonal antibody, 
preparations, monospecific antisera, human antibodies, or may be hybrid or chimeric antibodies, 
15 such as humanized antibodies, altered antibodies, F(ab *)2 ficagments, F(ab) fiagments, Fv 
fragments, single-domain antibodies, dimeric or trimeric antibody fragment constracts, 
minibodies, or functional fragments thereof which bind to the antigen in question. 

Antibodies ate produced using techniques weU known to those of skill in the art and 
disclosed in, for example, US Pat Nos. 4,011^08; 4,722,890; 4,016,043; 3,876,504; 3,770,380; 

■0 and 4,372,745. For example, polyclonal antibodies are generated by imnumizing a suitable 
animal, such as a mouse, rat rabbit sheep or goat with an antigen of interest In order to 
enhance immunogenicity, the antigen can be linked to a carrier prior to immunization. Such 
carriers are weD known to those of ordinary skill in the art Immunization is generally performed 
by mixing or emulsifying the antigen in saline, preferably in an adjuvant such as fteund's 
5 complete adjuvant and injecting the mixture or emulsion parenterally (generally subcutaneously 

or intramuscularly). The animal is generally boosted 2-6 weeks later with one or more injections 
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Of the antigen in saline, preferably using Freund’s incomplete adjuvant Antibodies may also be 

generated by m vitro immunization, using methods known in the art Polyclonal antiserum is 
then obtained from the immunized animal. 

Monoclonal antibodies are generally prepared using the method of Kohler & Milstein 
5 (1975) Nature 256:495-497 , or a modification thereof, as described above. 

As explained above, antibody firagments which retain the ability to recognize the antigen of 
mterest, will also find use in the subject immunoassays. A number of antibody fragments are 
known m the art which comprise antigen-binding sites capable of exhibiting immunological 
binding properties of an intact antibody molecule. For example, functional antibody fragments 
.0 can be produced by cleaving a constant region, not responsible for antigen binding, from the 
antibody molecule, using e.g., pepsin, to produce F(ah'h fragments. These fragments will contain 
two antigen binding sites, but lack a portion of the constant region from each of the heavy 
chams. Similarly, if desired. Fab fragments, comprising a single antigen binding site, can be 
produced, e.g., by digestion of polyclonal or monoclonal antibodies with papain. Functional 
5 fragments, including only the variable regions of the heavy and light chains, can also be 

produced, using standard techniques such as recombinant production or preferential proteolytic 
cleavage of rmmunoglobulin molecules. These fragments are knovm as Fv. See, e.g.; Inbar et al. 
(1972) Proc. Nat Acad. Sci. USA 69:2659-2662; Hochman et al. (1976) Biochem 15:2706- 
2710; and Ehrlich etal. (1980) Biochem 19:4091-4096. 

3 A smgle-chain Fv ("sFv" or "scPv”) polypeptide is a covalently linked Vh-Vl heterodimer 

which is expressed from a gene fusion including Vh- and VL-encoding genes linked by a peptide: 
encoding linker. Huston er aL (1988) Proc. Nat. Acad. Sci. USA 85:5879-5883. A number of 
methods have been described to discern and develop chemical structures (tinkers) for converting 

the naturally aggregated, but chemicaUy separated, tight and heavy polypeptide chains from an 

! antibody V region into an sFv molecule which will fold into a three dimensional stracture 

substantially sunilar to the stracture of an antigen-binding site. See, e.g., US PaL Nos. 5,091,513, 

5.132.405 and 4,946,778. The sFv molecules may be produced using methods described in the 
art See, e.g., Huston et al. (1988) Proc. NaL Acad. Sci. USA 85:5879-5883; US Pat. Nos. 

5.091.513. 5.132.405 and 4,946,778. Design criteria include determining the appropriate length 
to span the distance between the C-terminus of one chain and the N-terminus of the other, 
wherein the tinker is generally formed from small hydrophilic amino acid residues that (to not 
tend to coil or form secondary structures. Such methods have been described in the art See, e.g., 
US Pk. Nos. 5.091,513, 5,132,405 and 4,946,778. Suitable tinkers generally comprise 
polypeptide chains of alternating sets of glycine and serine residues, and may include glutamic 
acid and lysine residues inserted to enhance solubility. 
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"^Dni-aIltibodies" or "minibodies" will also find use with the present invention, 
hfinibodies are sFv polypeptide chains which include oligomeri 2 ation Hnmainc at their C- 
tennini, separated from the sFv by a hinge region. Pack et aL (1992) Biochem 31:1579-1584. 
The oligomerization domain comprises self-associating a-helices, e.g., leucine zippers, that can 
5 be further stabilized by additional disulfide bonds. The oligomerization domain is designed to be 
compatible with vectorial folding across a membrane, a process thought to facilitate in vivo 
folding of the polypeptide into a functional binding protein. Generally, minibodies are produced 
using recombinant methods well known in the art See, e.g.. Pack et oL (1992) Biochem 
31:1579-1584; Cumber et aL (1992) J. Lnmunology 149B: 120-126. 

10 Production of SARS Antigens 

The SARSV antigens used in the present invention are generally produced recombinantly. • 
Thus, polynucieotides encoding SARSV antigens for use with Ae present invention can be maHo . 
using standard techniques of molecular biology. For example, polynucleotide sequences coding ■ 
for Ae above-described molecules can be obtained using recombinant meAods, such as by 
.5 ^‘^’^^S^^I^NAandgenomiclihranesfiomcellsexpressing Ae gene, or by deriving Ae gene 

from a vector known to include Ae same. Furthermore, Ae desired gene can be isolated directly 
from viral nucleic acid molecules, usmg techniques described in Ae art, such as Aose described 
for HCV in Hougjiton et al., US Pat. No. 5,350,671. The gene encoding Ae antigen of mterest 
can also be produced syn AeticaUy, raAer Aan cloned. The molecules can be designed wiA 

0 appropriate codons for Ae particular sequence (preferably optimum codons for Ae expression 

host of choice). The complete sequence is Aen assembled from overlapping oUgonucleotides 
prepared by standard meAods and assembled mto a complete co ding sequence. See, e.g.. Edge 
il9Sl) Nature 292:756; Nambair et al. (1984) Science 223:1299; and Jay et aL (1984)7. Biol. 
Chem. 259:6311. 

5 Thus, particular nucleotide sequences can be obtained firom vectors harboring Ae desired 

sequences or synAesized completely or in part using various oligonucleotide synAesis 
techniques known m Ae art, such as site-directed mutagenesis and polymerase chain reaction 
(PCR) techmques where appropriate. See, e.g., Sambrook, supra. In particular, one meAod of 
obtaining nucleotide sequences encoding Ae desired ^uences is by annealing complementary 
) sets of overlapping synAetic oligonucleotides produced in a conventional, automated 

polynucleotide synAesizer, foUowed by ligation wiA an ^propriate DNA Hgase and 
amplification of Ae ligated nucleotide sequence via PCR. See, e.g., Jayaraman et al. (1991) 

Proc. Natl Acad. Sci. USA 88:4084-4088. Additionally, oligonucleotide directed synAesis 
(Jones et oL (1986) Nature 54:75-82), oligonucleotide directed mutagenesis of pre-existing 
) nucleotide regions (Riechmann et oL (1988) Nature 332:323-327 and VeAoeyen et al (1988) 
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Science 239:1534-1536), and enzymatic fiUing-in of gapped oKgonucleotides using T4 DNA 
polymerase (QaeeaetaL (1989) Proc. NatL Acad. ScL USA 86:10029-10033) can be used under 
the invention to provide molecules having altered or enhanced antigen-binding capabilities, 
and/or reduced imniunogenicity. 

5 Once coding sequences have been prepared or isolated, such sequences can be cloned into 

any suitable vector or replicon. Numerous cloning vectors are known to those of skiU in the art, 
and the selection of an appropriate cloning vector is a matter of choice. Suitable vectors include, 
but are not limited to, plasmids, phages, transposons, cosmids, chromosomes (including artificial 
chromosomes, such as B ACs or YACs) or viruses which are capable of replication when 

.0 associated with the proper control elements. 

The coding sequence is then placed under the control of suitable control elements, 
depending on the system to be used for expression. Thus, the coding sequence can be placed 
under the control of a promoter, ribosome binding site (for bacterial expression) and, optionally, 
an operator, so that the DNA sequence of interest is tran^bed into RNA by a suitable 
5 transformant. The coding sequence may or may not contain a signal peptide or leader sequence 

which can la^ be removed by the host in post-translational processing. See, e.g. , US PaL Nos. 
4,431,739; 4,425,437; 4,338,397. 

In addition to control sequences, it may be desirable to add regulatory sequences which ’ . 

allow for regulation of the expression of the sequences relative to the growth of the host cell. 

0 Regulatory sequences are known to those of skill in the art, and examples include those which 
cause the expression of a gene to be turned on or off in response to a chemical or physical 

stimulus, including the presence of a regulatory compound. Other types of regulatory elements 
may also be present in the vector. For exan^le, enhancer elements may be used herein to 
increase expression levels of the constructs. Examples include the SV40 early gene enhancer 
) (Dijkema et aL (1985) EMBO J. 4:761), the enhancei7promoter derived from the long terminal 
repeat (LTR) of the Rous Sarcoma Virus (Gorman et aL (1982) Proc. Natl. Acad. Sci. USA 
79:6777) and elements derived from human CMV (Boshart et al. (1985) Cell 41:521), such as 
elements included in the CMV intron A sequence (US PaL No. 5,688,688). The expression 
cassette may further mclude an origin of replication for autonomous replication in a suitable host 
' cell, one or more selectable maricers, one or more restriction sites, a potential for high copy 
number and a strong promoter. 

An expression vector is constructed so that the particular coding sequence is located in the 
vector with the appropriate regulatory sequences, the positioning and orientation of the coding 
sequence wrth respect to the control sequences being such that the coding sequence is transcribed 
under the "control" of the control sequences (te., RNA polymerase which binds to the DNA 
molecule at the control sequences transcribes the coding sequence). Modification of the 
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sequences encoding the molecule of interest may be desirable to achieve this end. For example, 
in some cases it may be necessary to modify the sequence so that it can be attached to the control 
sequences in tiie appropriate orientation; ie., to maintain the reading frame. The control 
sequences and other regulatory sequences may be ligated to the coding sequence prior to 
5 insertion into a vector. Alternatively, the coding sequence can be cloned directly into an 

expression vector which already contains the control sequences and an appropriate restriction 
site. 

As explained above, it may also be desirable to produce mutants or analogs of the antigen 
of interest Methods for doing so are described in, e.g„ Dasmahapatra et al„ US Pat No: 

10 5,843,752 and Zhang et al., US Pat No. 5,990,276. Mutants or analogs of SARSV proteins for 

use in the subject assays may be prepared by the deletion of a portion of the sequence encoding 
the polypeptide of interest by insertion of a sequence, and/or by substitution of one or mote 
nucleotides within the sequence. Techniques for modifying nucleotide sequences, such as site- 
directed mutagenesis, and the like, are weD known to those skilled in the art. See, e.g., Sambrook 
.5 et al, supra-, Kunkel, T. A. (1985) Proc. Natl Acad. ScL USA (1985) 82:448; Geisselsoder etal : 
(1987) BioTechniques 5:786; ZoUer & Smith (1983) Methods Enzymol 100:468; Dalbre- 
McFarland et al. (1982) Proc. Natl Acad Sci USA 79:6409. 

The molecules can be expressed in a wide variety of systems, including insect 
manunalian, bacterial, vital and yeast expression systems, all well known in the art. 

0 For example, insect cell expression systems, such as baculovirus systems, are known to 

those of skill in the art and described in, e.g.. Summers & Smith, Texas Agricultural Expenment ■ 
Station Bulletin No. 1555 (1987). Materials and methods for baculovirus/insect cell expression 
systems are commercially available in kit form from, inter alia, hivitrogen, San Diego Calif. 

( MaxBac kit). Similarly, bacterial and ma mmalian cell expression systems are well known in 
5 the art and described in, e.g. , Sambrook et al, supra. Yeast expression systems are also known in . 

the art and described in, e.g.. Yeast Genetic Engineering (Barr et al., eds., 1989) Butterworths, 
London. 

A number of appropriate host ceUs for use with the above systems are also known. For 
example, m a mm a li an cell lines are known in the art and include cell lines available 

3 from the American Type Culture Collection (ATCQ, such as, but not limited to, Chinese 

hamsto- ovary (CHO) cells, HeLa ceUs, baby hamster kidney (BHK) cells, monkey kidney cells 
(COS), human embryonic kidney ceUs, human hepatocellular carcinoma cells {e.g, Hq) G2), 
Madin-Darby bovine kidney ("MOBK") ceUs, as well as others. Similarly, bacterial hosts such as 
Ecoli, Bacillus subtilis, and Streptococcus spp., will find use with the present expression 
) constracts. Yeast hosts useful in the present invention include inter alia, Saccharomyces 

cerevisiae, Candida albicans, Candida maltosa, Hansenula polymorpha, Kluyveromyces fragilis, 

-199- 




wo 2004/092360 



PCT/DS2004/011710 



. Kluyveromyces lactis, Pichia guUlermiondii, Pichia pastoris. Schizosaccharomyces pombe and 
Yarrowia lipolytica. Lisect cells for use with baculovirus expression vectors include, inter alia, 
Aedes aegypti, Autographa califormica, Bombyx mori. Drosophila meUmogaster, Spodoptera 
Jrugiperda, and Trichoplusia ni. 

5 Nucleic acid molecules comprising nucleotide sequaices of intoest can be stably 

integrated into a host ceU genome or maintained on a stable episomal element in a suitable host 
cell using various gene delivery techniques well known in the art. See, e.g., US Pat. No. 
5399346. 

Depending on the expression system and host selected, the molecules are produced by 
10 growing host ceUs transformed by an expression vector described above under conditions 

whereby the protein is expressed. The expressed protein is then isolated fiom the host ceUs and 
purified. If the expression system secretes the protein into growth media, the product can be 
purified directly fiom the media. If it is not secreted, it can be isolated fiom cell lysates. The 

selection of the appropriate growth conditions and recovery methods are within the skill of the 
5 art 



EXAMPLE 

For useful expression of SARSV antigens in Saccharomyces cerevisiae and Pichia 
pastoris, insect cells, and m a mm alian cells, the following domains are cloned into expression 

vectors as listed in the Table below. The nt sequence numbers are fiom the SARSV sequence of 
0 SEQIDNO:!. 

- RNA polymerase la: SARS nt 250-13398 
RNA polymerase lb: SARS nt 13399-21470 

- ORRis.aivelope (homologous to ns2, hemagglutinin-esterase envelope glycoprotein, 
and spike glycoprotein): SARS nt 21477-25244 

5 - Membrane: SARS nt 27849 - 28103 



Nucleocapsid: SARS nt 28105 - 29373 



A combination of PCR and synthetic oligos is used to create the above domains with restriction 
sites tailored to the following expression vectors: 



Restriction ends 
HindnVSall 
EcoRJJSam 
Xbal/San. 

EcoRl/BamHl 

EcdRVXnud 

NheVSaa 



Vector 

pBS24.1 

pBS24.1 

pA0815 

pCMVkm2 

pCMVin 

pBluBac4.S 



Promoter 
ADH2/GAPDH 
ADH2/GAPDH/SOD fusion 
AOXl 

CMVp/Enhancei/IhtronA 

CMVp/Enhanca/IntronA 

Polyhedrin 



Expression host 

AD3/Saccharomyces 

AD3/Saccharomyces 

GS 1 IS/Pichia pastoris 

HVK-293/Transient 

transfection 

CHO stable cell line 

Cell lines enq>loyed by Chinxi 

include: Sf9, Sf21, Tn5 
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IV. TREATMENT OF SARS INFECTION WITH RNAi 

RNA int^iference or "RNAi" is a term initially coined by Fire and co-workers to describe 
the observation that double-stranded RNA (dsRNA) can block gene expression when it is 
introduced into worms (Fire et al.. Nature 391, 806-811(1998)). RNAi most likely involves 
5 mRNA degradation, resulting in sequence-specific, post-trariscriptional gene silencing in many 

organisms. RNAi is a post-transcriptional process triggered by the introduction of double- 
stranded RNA which leads to gene silencing in a sequence-specific manner. RNAi has been 
reported to occur naturally in organisms as diverse as nematodes, trypanosmes, plants and fimgi. 
It most likely serves to protect organisms from viruses, modulate transposon activity and 
10 eliminate aberrant transcription products. 

The first evidence that dsRNA could achieve efficient gene silencing through RNAi r.amp. 
from studies on the nematode Caenorhabditis efegons (Fire et aL (1998) Nature, 391:806-811 
and US Patent No. 6,506,559). hater studies in the finit fly Drosophila melanogaster 
demonstrated that RNAi is a two-step mechanism (Elbashir et al. (2001) Genes Dev., 15(2): 188- 
15 200). Hrst, long dsRNAs are cleaved by an enzyme known as Dicer in 21-23 nucleotides (nt) 

fragments, caUed small interfering RNAs (siRNAs). Then, siRNAs associate with a ribonuclease 
complex (termed RISC for RNA Induced Silencing Complex) which target this complex to 
com^plementary mRNAs. RISC then cleaves the targeted mRNAs opposite the complementary 
siRNA, which makes the mRNA susceptible to other RNA degradation pathways. 

20 RNAi is the phenomenon where dsRNA corresponding to a targeted DNA or RNA 

sequence can suppress or silence gene expression. Even though dsRNA can mpfliatR gene- 
specific interference in mammalian cells in some circumstances (W ^ann y & Zemicka-Goetz 
(2000) Nature Cell Biol. 2:70-75; Svoboda et cd. (2000) Development 17:4147-4156) the use of 
RNAi in mammalian somatic ceDs is often limited due to the dsRNA triggering dsRNA- 
25 dependent protein kinase (PKR) which in turn inactivates translation factor eIF2a and causes a 
generalized suppression of protein synthesis and often times ^optosis (Gil & Esteban (2000) 
Apoptosis 5:\0n -114^. 

Recently, gene-specific suppression using siRNA of approximately 21 or 22 base pairs in 
length, corresponding to targeted RNA or DNA sequences, were shown to disrupt the expression 
30 of these targeted sequences in mammalian cells (Elbashir, SM., et al.. Nature 411: 494-498 

(2001)). However, it is not clear that all RNA or DNA sequences of a mammali an cell’s genome 
are susceptible to siRNA. It is also uncertain that every mammali an cell type possesses the 
necessary machinery for effecting gene-specific suppression using siRNA. Further, siRNA is of 
limited use for at least two reasons: the transient nature of the suppression effect seen in cefis 
35 where the siRNA has been administered; and in some instances the necessity for chemical 

synthesis of siRNAs before their use CTuschl T., Nature BiotechnoL, 20: 446-448 (2002)). Also 
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the instability of these short, s^thetic RNAs makes it presents problems for any long term use of 
these siEtNAs a pharmaceutical. 

To overcome this limitation, the present invention provides a TnnftjfiftH siRNA with 
increased stability against nuclease degradation while still maintainin g its ability to inhibit viral 
5 replication via RNA interferraice. Such modification to the ribonucleotides in the siRNAs, adds 

a chemical group via chemical synthesis or in vitro transcription or longer modified RNAs can 
be prepared by either of these methods and cut into siRNAs using Dicer. 

Although other methods for gene-specific suppression have utilized chemically-modified 
nucleic acids, such as antisense and ribozyme technology, such modification destroys critical 
10 raizymatic activities necessary for the fimction of these technologies. In regard to antisense 
technology, modification of the ribonucleotides destroys RNaseH activity, whereas such 
. modification abolishes the catalytic activity of ribozymes. 

'Ihe present invention provides a double-stranded RNA (dsRNA) molecule modified for 
protection against nuclease degradation with a length fiom about 10 to about 30 nucleotides 
15 which is able to inactivate a virus in a mammalian cell. The invention also provides a method of 
inactivating a virus by administering modified small interfering RNAs (siRNAs) that are 
modified so that they are nuclease or RNase resistant and retain the biological activity of being 
able to inhibit viral replication by targeting a RNA sequence in a virus. 

, The invention is further directed to a method of making modified siRNAs that target a 
10 RNA sequence in a virus comprising preparing a modified-double stranded RNA (dsRNA) 
fragment containing at least one modified ribonucleotide in at least one strand that spans the 
genome of the virus; and cleaving the modified-dsRNA fragments with recombinant human 
Dicer resulting in more than one modified siEiNA. 

The present invention provides a modified dsElNA molecule of ftom about 10 to about 30 
15 nucleotides which mediates targeted RNA interference in hepatic or S ARS-infected cells. 

As used herein RNA interference, or RNAi, is used to mean sequence-specific, or gene 
specific, suppression of gene expression (protein synthesis), without causing a generalized 
suppression of protein synthesis in cells harboring the siRNA. Die invention is not limited to a 
particular theory of the mechanism of action of RNAi. For example, RNAi may involve 
10 degradation of messenger RNA (mRNA) in an RNA-induced silencing complex (RISC), 

preventing translation of the transcribed mRNA, or it may involve the methylation of genomic 
DNA, shunting transcription of the gene. The lack of gene expression caused by RNAi may be 
transient, lasting a short period of time, or it may be stable, or permanent, lasting an indefinite 
period of time. 

i5 The term RNA is meant as is recognized in the art Further, as used herein, RNA is used to 

mean double-stranded RNA (dsRNA) or sin^e-stranded RNA (ssRNA) or a dsRNA with a 
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single-stranded oveihang. dsRNAs within Ae meaning of Ae present mvention mcludes short 
mterfering RNA (siRNA), micro RNA (miRNA) and small haiipm RNA (shRNA), 

AdAtionally, RNA is also used to mean messenger RNA (mRNA), transfer RNA (tRNA) or 
libosomal RNA (rRNA). 

5 Hie present mvention is directed to smaU mterfering RNA (siRNA) which have been 

chemically modified to confe increased stabAty against nuclease degradation yet Aese siRNAs 
are still able to bmd to target RNAs, that may be present m a cells. Li Ae case where Ae target 
RNA is a virus specific RNA, Ae modified siRNAs are able to bmd to Ae virus specific RNAs 
and mactivate Ae virus. A modified siRNA of Ae present mvention conyirises a modified 
10 ribonucleotide, wherem Ae sURNA is resistant to enzymatic degradation, such as RNase 

degradation, and yet retains Ae abAty to inhibit viral replication. The modified siRNA is more 
specifically modified at Ae 2' position of Ae ribose m Ae siRNA. The modification is at Ae 2' 
position of at least one ribonucleotide of said siRNA. Attachment of receptor-bm Ang bgands to : 
siRNA molecAes can be used to tar^ Ae siRNA to a desired cell type. For example, 

15 attachment. of cholesterol at Ae 5'-end or 3'-aid of Ae siRNA molecAe, to give a cholesteryl 

siRNA, can enhance targetmg to hepatocytes. OAer ligands for receptor mediated siRNA 
targetmg to liver mclude HBV surface antigen, LDL, and oAers. 

More specifically, Ae siRNA is moAfied at at least one pyrimiAne, at least one purine or a 
combination Aereof. However, generally all pyrimidmes, or aU purines or a combination of all 
iO pyiimiAnes and all purines of Ae siRNA are modified. More preferably, Ae pyrimidines are • 

modified and Aese pyrimidines are cytosme, a derivative of cytosine, uracil, a derivative of 
uracil or a combmadon Aereof. It also is conten^lated to modify Ae selected ribonucleotides m- 
at least one strand of Ae siRNA or Ae ribonucleotides m bo A strands of Ae siRNA are 
modified. 

15 The nucleotides containing pyrimidine bases found m RNA (cytidine and uridine) can be 

chemically modified by adding any molecule Aat inhibits RNA degradation or breakdown to Ae 
2' position of Ae ribose molecule. The 2'-modified p yrimidine nucleotide can be formed usmg a 
number of Afferent meAods. The 2' modification confers mcieased stability to Ae siRNA by ■ 
m a k i n g Ae siRNA imprarious or resistant to nuclease activity. Hius, Ae 2' moAfied siRNA has 
$0 a longCT serum half-life and is resistant to degradation compared to unmodified siRNA. The 
siRNA also may be modified completely or partiaUy. 

RegarAng chemical modification of siRNAs, a molecule from Ae halide chemical group is 
preferably added to Ae ribonucleotide of Ae siRNA. Within Ae halides, fluorine is Ae preferred 
molecule but oAer chemical molecules, in adAtion to fluoro-, such as meAyl-, meAoxyeAyl- 
•5 and propyl-moAfications can also we made. But Ae prefixed moAcations is fluoro- 

moAfication, such as a 2'-£luoro-moAcation or a 2',2'-fluoro-modification.llius, in a prefrared 
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embodiment of the invention, the siRNA is modified by adding a fluorine molecule to the 2' 
carbon of the pyrimidine ribonucleotide. The siRNA may be fluorinated completely or partially. 
For example, only die cytosine nucleotides need be fluorinated. Alternatively, only the uracil 
nucleotide need be fluorinated but both uracil and cytosine can be fluorinated. Furthermore, 

5 only one strand, dther sense or antisense, of the siRNA can be fluorinated. Even partial 2' 
fiuorination the siRNA gives protection against nucleolytic degradation. Furthermore, it is 
important to note the 2' fluorinated siRNA is not toxic to cells, an unexpected result given that 
fluorine chemistry usually is toxic to living organisms. 

The siRNA of the present invention is designed to interact with a target nucleotide 
10 sequence. Most preferably this target nucleotide sequence is a disease producing agent or 

pathogen of which one wishes to inhibit gene expression. More preferably, this target nucleotide 
sequence is in a virus genome, and further this virus genome is from a RNA virus or a DNA 
virus is selected from the group consisting of hepatitis C virus OHCV), hepatitis A virus, hepatitis 
B virus, h^atitis D virus, hepatitis E virus, Ebola virus, influenza virus, rotavirus, reovirus, 

15 retrovirus, poliovirus, human papilloma virus (HPV), metapneumovirus and.coronavirus. The 
most preferred virus is SARS virus. 

Modfied siRNA may be prepared in a number of ways, such as by chemical synthesis, T7 
polymerase tr^scription, or by treating modified long double stranded RNA (dsRNA) prepared 
by one of the two previous methods with Dicer enzyme. Dicer enzyme can be used to cleave 
iO dsRNA that is about 500 base pairs to about 1000 base pairs in size, to created mixed ' 
populations of dsRNA from about 21 to about 23 base pairs in length. Furthermore, an 
unexpected result of using the Dicer enzyme method is that Dic» enzyme will cleave modified 
strands of dsRNA, such as 2' fluorinated modified dsRNA. Before development of this method, 
it was previously thought that Dicer would not be able to cleave modified siRNA. The Dicer 
15 method can be carried out using the Dicer siRNA Generation Kit available from Gene Therqry 
Systems, San Diego, CA. 

As used herein, small interfering RNA (siRNA) is defined as double- or single-stranded 
RNA of from about 10 to about 30 nucleotides in length, more preferably 12-28 nucleotides, 
more preferably 15-25 nucleotides, even more preferably 19-23 nucleotide^ and most preferably 
SO 21-23 nucleotides. The length of a siRNA as used herein, is dete rmine d by the length of one of 
the strands of the RNA. For example, a siRNA that is described as 21 nucleotides long (a 21- 
mer) may comprise two opposite strands of RNA which anneal together for 19 contiguous base 
pairings. The two remaining nucleotides on one end of the molecule would not aimeal to the 
opposite strand, thus creating an “overhang”. The overhang can be at the 5' or the 3' end of the 
>5 dsRNA. Preferably, the oveAang is at the 3' end of the RNA strand. The length of a double- 
stranded RNA where the two opposite strands are not the same length will be designated by the 
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longer of the two strands. For example, a dsRNA comprising one strand which is 21 nucleotides 
long and anneals to an opposite strand that is 20 nucleotides long, will be considered, as used 
herein, a 21-mer. 

Preferably, the siRNA of the present invention will comprise a 3' overhang of about 2 to 4 
5 bases. More preferably, the 3* overhang is 2 nucleotides long. Even more preferably, the 2 
nucleotides comprising the 3* ov^ang are uridine (U). 

In one embodiment, the invention provides an RNA molecule comprising a nucleotide 
sequence at least 80% identical to the nucleotide sequence of the target agent or virus. 

Preferably, the RNA molecule of the present invention is at least 90%, 95%, 96%, 97%, 98%, 

10 99% or 100% identical to the nucleotide sequence of the target agent or virus. 

As a practical matter, whedier any particular nucleic acid molecule is at least 90%, 95%, 
96%, 97% 98%, 99% or 100% identical to the nucleotide sequence of the target agent or virus 
can be determined conventionally using known computer programs such as the Bestfit program 
(Wisconsin Sequence Analysis Package, Version 8 for Unix, Genetics Computer Group, ‘ 

15 University Research Park, 575 Science Drive, Madison, Wis. 53711). Bestfit uses the local 
homology algorithm of Smith & Waterman {Advances in Applied Mathematics 2:482-489 
(1981)) to find the best segment of homology between two sequences. When using Bestfit or any 
other sequence alignment prograni to determine whether a particular sequence is, for instance, ' 
95% identical to a reference sequence according to the present invention, the parameters are set, ; 
iO of course, such that the percentage of identity is calculated over the full length of the reference 

nucleotide sequence and that gaps in homology of up to 5% of the total number of nucleotides in, 
the reference sequence are allowed. 

The present invention provides a method of inactivating a target agent or preferably a virus 
in a. patient comprising administering to the patient a modified siRNA in an effective amount to 
t5 inactivate the targeted agent or virus. RNA interference towards a targeted DNA segment in a 
cell can be achieved by administering a dsRNA molecule or siRNA to the cells, wherein the 
nucleotide sequence of the dsRNA molecule corresponds to the nucleotide sequence of the 
targeted DNA segment. Preferably, the RNA molecule used to induce targeted RNAi is siRNA. 

Goie suppression, targeted suppression, sequence-specific suppression, targeted RNAi or 
•0 sequence-specific RNAi are used interchangeably herein. Furthermore, sequence-specific 
suppression, as used herein, is determined by separately assaying the levels of the protein 
targeted for suppression in cells containing the siRNA (experimental cells) and in cells not 
containing the identical siRNA (control cells), and comparing the two values. Furthermore, the 
experimental and control cells must be derived from the same source and gamp, animal For 
5 example, the control and experimental cells can be, but are not limited to, normal human hepatic 
cells as cell culture in vitro, or they can derived from a hepatocellular carcinoma. Further, the 
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control and expoimental cells used in determining the level or quantity of gene suppression must 
be assayed under similar, if not identical, conditions. 

As used herein the phrase “targeted DNA segment” is used to mean a DNA sequence 
encoding, in whole or in part, an mRNA for a targeted protein, including introns or exons, where 
5 suppression is desired. DNA segment can also mean a DNA sequence that normally regulates 
expression of the targeted protein, including but not limited to the promoter of tihe targeted 
protein. Furthermore, the DNA segment may or may not be a part of the cell’s genome or it may 
be extrachromosomal, such as plasmid DNA. 

'Ihe present invention is further directed to inactivating a virus in a patient comprising 
10 administering to a patient a modified siRNA in an effective amount to inactivate the virus. The 
siRNA is preferably about 10 to about 30 nucleotides in length, more preferably 12-28 
nucleotides, mote preferably 15-25 nucleotides, even mote preferably 19-23 nucleotides and 
most preferably 21-23 nucleotides. The method prefraably utilizes a 2' mndifipd siRNA that is 
modified at the 2' position of at least one ribonucleotide of said siRNA. The method utilizes a 
15 SiRNA that is modified with chemic^ groups selected from the group consisting of fluoro-, 

methyl-, methoxyediyl- and propyl-modification. The fluoro-modification is preferred and either 
a 2'-fluoro-modication or a 2',2'-fIuQro-modification is useful in the present invention and 
preferred. 

The modification may be at the pyrimidines, the purines or a combination thereof of the 
10 siRNA are modified. More preferably the pyrimidines are modified, such as cytosine, a 
derivative of cytosine, uracil, a derivative of uracil or a combination thereof. In one ■ 
embodiment, at least one strand of the siRNA contains at least one modified nucleotide and in an 
alternate embbdiment, oth strands of the siRNA contains at least one modified nucleotide. 

The method is intended to target disease causing agents or pathogens, an more particularly 
15 viruses, which can be either a RNA virus or a DNA virus, which are selected fiom the group 
consisting of hepatitis C virus (HCV), hepatitis A virus, hepatitis B virus, hepatitis D virus, 
hepatitis E virus, Ebola virus, influenza virus, rotavirus, reovirus, retrovkus, poliovirus, human 
papilloma virus (HPV), metapnemnovirus and coronavirus. More preferably the target virus is a 
SARS virus. Ihe present method utilizes a siRNA prepared by (a) identifying a target nucleotide 
>0 sequence in a virus genome, preferably SARS virus, for designing a small interfering RNA 

(siRNA); and (b) producing a siRNA that has been modified to contain at least one rnnHifipH 
nucleotide. More preferably, the siRNA comprises a dsRNA molecule with a first strand 
ribonucleotide sequence corresponding to a nucleotide sequence corresponding to a target 
nucleotide sequence in said virus and a second strand com prising a ribonucleotide sequence 
5 complementary to said target nucleotide sequence, wherein said first and second strands are 

separate complementary strands that hybridize to each other to form said dsRNA molecule, and 
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ftnther whereto the firet strand ribonucleotide sequence, the second strand ribonucleotide 
sequence orboththeflistandsecond strand ribonucletide sequences comprise a least one 
modified nucleotide. In this method, the tacget nucleotide sequence comprises a conserved 
nucleotide sequence necessary for SARS wrus repUcation, and the conserved nucleotide 
3 sequence is selected ftom the group consisttog of SEQ ID NO: 7292, SEQ ID NO: 7293 SEQ ID 

NO: 7294, SEQ ID NO: 7295, SEQ ID NO: 7296, SEQ ID NO: 7297, SEQ ID NO: 7298 SEQ 
ID NO: 7299, SEQ ID NO: 7300 and SEQ ID NO: 7301. Preferably, the nucleotide sequmice is 
selected ftom the group consisting of SEQ ID NO: 7292 and SEQ ID NO: 7293. Still mote 
preferably, the nucleotide sequence is SEQ ID NO: 7293. 

'-0 The siRNA disclosed in this appUcation may be prepared with modified ribonucleotides as 

described herein. Further, the modified ribonucleotide of the siRNA used in the present method 
is incorporated into said siRNA by chemical synthesis or enzymatic synthesis. 

The SiRNA disclosed in this application may or may not have a 5’ triphosphate group. 

The modified siRNA is administered to a patient by a method selected fiom the group 
5 consiaring of intravenous injection, subcutaneous injection, oral deUvery, and Bposome delivery, 
nie modified SiRNA accumulates to an organ, tissue or body system of the patient that are the 
liver, gastrointestinal tract, respiratory tract, cervix or skin. 

The present invention also provides a method of inhibiting the replication of a virus, such 
as SARS vrrus, m cells positiye for SARS virus comprising transfecting SARS-positive ceUs 
3 wrth a vector that directs the expression of modified siRNA which is specific for SARS. The 

cells are evaluated to determine if a marker in the ceDs has been inhibited by the modified 
SiRNA. 

ITie term patient, as used herern, can be an animal, preferably a mammal. More preferably 

the subject can be a primate, including non-human and humans. The terms subject and patient 
► can be used interchangeably. 

The treatment envisioned by the current invention can be used for subjects with a pre- 
exrstmg viral infection, or for subjects pre-disposed to a SARS virus infection. AdditionaDy, the 

method of the current invention can be used to correct or compensate for ceUular or 

physiological abnormalities involved in conferring susceptibility to viral infections in patients, 

and/or to alleviate symptoms of a viral infection in patients, or as a preventative measure in 
patients. 

The method of treating a patient having a viral infection involves administration of 
compositions to the subjects. As.used herein, composition can mean a pure compound, agent or 
substance or a mixture of two or more compounds, agents or substances. As used herein, the 
term agent, substanee or compound is intended to mean a protein, nucleic acid, carbohyrtate, 
lipid, polymer or a small molecule, such as a drug. 
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In one embodiment of the current invoition, the composition administered to the subject is 
a pharmaceutical composition. Further, the pharmaceutical composition can be administered 
oraUy, nasaUy, parenteraUy, intrasystemically, intraperitoneaUy, topically (as by drops or 
transdermal patch), bucally, or as an oral or nasal spray. Hie term "parenteral," as used herein, 

5 refers to modes of administration which include intravenous, intramuscular, intraperitoneal, 
intrastemal, subcutaneous and intraarticular injection and infusion. The pharmaceutical 
compositions as contemplated by the current invention may also include a pharmaceutically 
acceptable carrier. 

By "pharmaceutically acceptable carrier" is intended, but not limited to, a non-toxic solid, 
10 semisolid or liquid filler, diluent, encapsulating material or formulation auxiliary of any type, 
such as liposomes. 

A pharmaceutical composition of the present invention for parenteral injection can 
comprise pharmaceutically acceptable sterile aqueous or nonaqueous solutions, dispersions, 
suspensions or emulsions as well as sterile powders for reconstitution into st^e injectable 
15 solutions or dispersions just prior to use.. Examples of suitable aqueous and nonaqueous carriers, 
diluents, solvents or vehicles include water, ethanol, polyols (such as glycerol, propylene glycoh 
polyethylene glycol, and the like), carboxymethylcellulose and suitable mixtures thereof, 
vegetable oils (such as olive oO), and injectable organic esters such as ethyl oleate. Proper 
fluidity can be maintained, for example, by the use of coating materials such as lecithin, by the • 
iO maintenance of the required particle size in the case of dispersions, and by the use of surfactants. 

The compositions of the present invention can also contain adjuvants such as, but not 
limited to, preservatives, wetting agents, emulsifying agents, and dispersing agents. Prevention 
of the action of microorganisms can be ensured by (he inclusion of various antibacterial and 
antifungal agents, for example, paraben, chlorobutanol, phenol, sorbic add, and the lil®. It can 
15 also be desirable to include isotonic agents such as sugars, sodium chloride, and the like. 
Prolonged absorption of the injectable pharmaceutical form can be brought about by the 
inclusion of agents which delay absorption such as aluminum monostearate and gelatin. 

In some cases, to prolong the effect of the drags, it is desirable to slow the absorption from 
subcutaneous or intramuscular injection. This can be accomplished by the use of a liquid 
>0 suspension of crystalline or amorphous material with poor water solubility. The rate of 

absorption of the drag then depends upon its rate of dissolution which, in turn, can depend upon 
crystal size and crystalline form. Altmnatively, delayed absorption of a parenteraUy 
administered drag form is accomplished by dissolving or suspending the drug in an oil vehicle. 

Injectable depot forms are made by forming microencapsule matrices of the drag in 
•5 biodegradable polymers such as polylactide-polyglycolide. Depending upon the ratio of drag to 
polymer and the nature of die particular polymer employed, the rate of drag release can be 
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controlled. Examples of otha: biodegradable polymers include poly(orthoesters) and 
poly(anhydrides). Depot injectable formulations are also prepared by entrapping the drug in 
Kposomes or microemulsions which are compatible with body tissues. 

The injectable formulations can be sterilized, for example, by filtration through a bacterial 
5 retaining filter, or by incorporating sterilizing agents in the form of sterile solid compositions 

which can be dissolved or dispersed in sterile water or other sterile injectable medium just prior 
to use. 

Solid dosage forms for oral administration include, but are not limited to. capsules, tablets, 
pills, powders, and granules. In such solid dosage forms, the active compounds are mixed with 
.0 at least one item pharmaceutically acceptable excipient or carrier such as sodium citrate or 
dicalcium phosphate and/or a) fillers or extenders such as starches, lactose, sucrose, glucose, 
mannitol, and silicic acH b) binders such as. for example, caiboxymethylcellulose, alginate^, 
gelatm, polyvinylpyrrolidone, sucrose, and acacia, c) humectants such as glycerol, d) . 
disintegrating agents such as agar-agar, calcium carbonate, potato or tapioca starch, alginic acid, 
5 certam silicates, and sodium carbonate, e) solution retarding agents such as paraffin, f) 

absorption accelerators such as quaternary ammonium compounds, g) wetting agents such as, for 
example, acetyl alcohol and glycerol monostearate, h) absorbents such as kaolin and bentonite 
clay, and i) lubricants such as talc, calcium stearate, magnesium stearate, soUd polyethylene 

glycols, sodium lauryl sulfate, and mixtures thereof. In the case of capsules, tablets and pills, the. 
) dosage form can also comprise buffering agents. 

Solid compositions of a similar type can also be employed as fillers in soft and hard fiUed 
gelatin capsules using such excipients as lactose or milk sugar as weU as high molecular weight 
polyethylene glycols and the like. 

The solid dosage forms of tablets, dragees, capsules, pills, and granules can be prepared 
with coatings and shells such as enteric coatings and other coatings well known in the 
pharmaceutical formulating art They can optionaUy contain opacifying agents and can also be 
of a composition that they release the active ingredient(s) only, or preferentially, in a certain part 
of die intestinal tract optionaUy. in a delayed manner. Examples of embedding compositions 
which can be used include polymeric substances and waxes. 

The active compounds can also be in micro-encapsulated form, if appropriate, with one or 
more of the above-mentioned excipients. 

Liquid dosage forms for oral administration include, but are not limited to, 
pharmaceudcaUy acceptable emulsions, solutions, suspensions, syrups and eUxirs. In addition to 
the active compounds, the Uquid dosage forms can contain inert dUuents commonly used in the 
art such as. for example, water or other solvents, solubiUzing agents and emulsifiers such as 
ethyl alcohol, isopropyl alcohol, ethyl carbonate, ethyl acetate, benzyl alcohol, benzyl benzoate. 
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propylene glycol, 13-butylene glycol, dimethyl foimamide, oils 0n particular, cottonseed, 
groundnut, com, germ, olive, castor, and sesame oils), glycerol, tetrahydrofiirfuryl alcohol, 
polyethylene glycols and fatty acid esters of sorbitan, and mixtures thereof. 

Besides inert diluents, the oral compositions can also include adjuvants such as wetting 
5 agents, emulsifying and suspending agents, sweetening, flavoring, and perfuming agents. 

Suspensions, in addition to the active compounds, can contain suspending agents as, for 
example, ethoxylated isostearyl alcohols, polyoxyethylene sorbitol and sorbitan esters, 
microcrystalline cellulose, aluminum metahydroxide, bentonite, agar-agar, and tragacanth, and 
mixtures thereof. 

10 Alternatively, the composition can be pressurized and contain a compressed gas, such as 

nitrogOT or a liquefied gas propellant. The liquefied propellant medium and indeed the total 
composition is preferably such that the active ingredients do not dissolve therein to any 
substantial extent The pressurized composition can also contain, a smface active agent The 
surface active agent can be a liquid or solid non-mnic surface active agent or can be a solid 
i5 anionic surface active agent It is preferred to use the solid anionic surface active agent in the 
form of a sodium salt. 

The compositions of the present invention can also be administered in the fonn of 
liposomes. As is known in the art, liposomes are generally derived from phospholipids or other 
lipid substances. Liposomes are formed by mono- or multi-lamellar hydrated liquid crystals that 
.0 are dispersed in an aqueous medium. Any non-toxic, physiologically acceptable and ■ 
metabolizable lipid capable of forming liposomes can be used. The present compositions in 
liposome form can contain, in addition to the compoimds of the invention, stabilizers, 
preservatives, excipients, and the like. The preferred lipids are the phospholipids and the 
phosphatidyl cholines Oecithins), both natural and synthetic. Methods to form liposomes are 
5 known in the art (sec, for example, Prescott, Ed., Meth. Cell Biol. 14:33 et seq (1976)). 

One of ordinary skill wiD appreciate that effective amounts of the agents of the invention 
can be determined empirically and can be enqiloyed in pure form or, where such forms exist, in 
pharmaceutically acceptable salt, ester or prodrug form. The agents can be administered to a 
subject, in need of treatment of viral infection, as pharmaceutical compositions in combination 
0 with one or more pharmaceutically acceptable excipients. It will be understood that, when 

administered to a human patient, the total daily usage of the agents or composition of the present 
invention will be decided by the attending physician within the scope of sound medical 
judgement The specific therapeutically effective dose level for any particular patient will 
depend upon a variety of factors: the type and degree of the cellular or physiological response to 
5 be achieved; activity of the specific agent or composition employed; the specific agents or 

composition employed; the age, body weight, general health, sex and diet of the patiwit; the time 
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of administration, route of administration, and rate of excretion of the agent; the duration of the 
treatment; drugs used in combination or coincidental with the specific agent; and like factors 
well known in the medical arts. For example, it is well within the skiU of the art to start doses of 
the agents at levels Iowct than those required to achieve the desired therapeutic effect and to 

5 gradually increase the dosages until the desired effect is achieved. 

Dosing can also be arranged in a patient specific manner to provide a predetermined 
concentration of the agents in the blood, as determined by techniques accepted and routine in the 
art Urns patient dosaging can be adjusted to achieve regular on-going blood levels, as measured 
by HPLC, on the order of fiom 50 to 1000 ng/ml. 

10 It wUl be readily apparent to one of ordinary skill in the relevant arts that other suitable 

modifications and ad^tations to the methods and applications described herein can be maHp 
without departing from the scope of the invention or any embodiment thereof. 

The modified siRNA is prepared by custom chenoical synthesis by Dharmacon, at 
Lafayette CO. Each C and U within the siRNA duplex (GL2), has been substituted with 2’-F-U • 
.5 and 2 -F-C except for the 3’-end overhang, which was dTdT. 

To test the stability of 2' chemically modified siRNA compared to unmodiGed siRNA 
(siRNA), the following experiment is performed. 4ngs of siRNA are added to a 20 pL volume of 
80% human serum from a healthy donor. This mixture is incubated at 37°C for various times 
ranging fiom 1 minute up to 10 days. The same process is performed for 2' fluorine modified 
0 SiRNA (2’-F SiRNA). When the incubation process is finished, the mixtures are placed on ice and 
tiien immediately separated by PAGE along with a ^¥-siRNA control. The 2’ modified siRNA 
is stable as compared to unmodified siRNA. 

V. IDENTIFICATION OF THERAPEUTICALLY ACTIVE AGENTS FOR TREATMENT 
OF SARS VIRUS INFECTION 

5 The invention provides methods for treating SARS by administering therapeutically active 

agents, such as smaU molecule compounds, to a mammal, as weU as methods of identifying 

therapeutically active agents, such as potent smaU molecules, for the treatment of SARS virus 
infection. 

In one aspect of the invention a method of identifying a therapeuticaUy active agent is 
) provided comprising: (a) contacting the therapeuticaUy active agent with a ceU infected with the 
SARS virus; (b) measuring attenuation of a SARS related enzyme. 

In a more particular embodiment, the therapeuticaUy active agent is a smaU molecule. In 
another more particular embodiment, the therapeuticaUy active agent is a nucleoside analog (e.g. 
Ribavirin). In another more particular embodiment the smaU molecule is a SMIP or peptidic 
; immunomodulating compound. In another more particular embodiment the therapeuticaUy 
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active agent is a pq)toid, oUgopeptide, or polypeptide. In another embodiment the SARS related 
enzyme is SARS protease. In another embodiment the SARS related enzyme is SARS 
polymerase. In still another embodiment the SARS related enzyme is a kinase. In still another 
embodiment, the SARS related enzyme is a protease. The finin inhibitor peptidyl 
5 chloromethylketone prevents blocks cell-ceU fusion after MHV infection (de Haan et aL (2004) J 
Virol), which offers guidance for SARS therapy. 

The invention includes a cell-based assay that can be used to screen for and identify a 
therapeutically active agent for the treatment of SARS virus infection. Therapeutically active 
agents of the invention include agents that inhibit, prevent or reduce the replication of a SARS 
10 virus. Such agents can be identified by infecting a cultured cell (such as, for example, VERO 
cells) with a SARS virus and evaluating the impact of potential antiviral compounds on SARS 
virus replication. Assays to measure the effect of a potential antiviral compound on virus 
replication are known in the art and may be based on a variety of parameters. 

The cell-based assay may be used in a higb-throughput screen to identify therapeutically 
15 active compounds fi^’om chemical libraries comprising potential antiviral compounds. 

Therapeutically active compounds suitable for use in the invention may inhibit any SARS viral 
target that is essential for viral replication in whole cells. Efficacy (the ability of a compound to 
inhibit or inactivate the target, be it viral or cellular, that results in the reduction of viras in the 
culture) of the therapeutic agent is measured by assessing the viability and/or the proliferation of 
10 surviving cells in a SARS virus infected cell culture. 

A number of mefliods can be used to measure cell viability are known in the art, such as 
assays measuring cellular enzymes, proteins, nucleotide triphoshates (e.g. ATP), nucleic acids 
(e.g. host cell mRNA (e.g. GAPDH) or rRNA sequences) or cellular metabolites such as MTT or 
MTS. In addition, fluorescent (including, for example HSV p^er) or non-fluoiescent dyes (e.g. 
t5 propidium diiodide) or labeling of DMA can be used to measure indications of cell viability 
and/or proliferation. 

Alternatively, efficacy of a compound or sample can be determined by directly measuring 
the amount of virus or viral products in the culture. Methods for measuring the amount of virus, 
viral genome or viral products include: PCR, RT-PCR, TMA, reporter proteins with fluourescent 
0 or luminescent qualities or enzymatic functions (e.g., ludfoase, alkaline phosphatase, GFP) or 
proteins that can be detected by antibodies (e.g. EOF) that mi^t be incorporated into the viral 
genome prior to infection of the cell culture. Further, viral products such as viral proteins can be 
measured by ELISA or enzymatic activities. Methods for identifying viral polynucleotides, viral 
proteins and ^tibodies specific to viral proteins are disr-us se d above. 

5 Potential antiviral compounds are applied to the cell-based assay at a concentration of 

approximately 10 fiM and compound classes having therapeutic effect are identified by 
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measuring the parameter of choice (such as cell viability/proliferation or the virus or viral 
genome or a viral product be it viral in origin or non-viius in orgin). Once compounds are 
identified as having activity, they are resynthesized, and analoged. Starting with the identified 
confound, many analogs and new compounds are synthesized during consecutive optimization 
5 cycles of synthesis, biological profiling and modeling techniques to optimize the to the lead 
structure until in vivo activity is elucidated and optimized 

Cells suitable for use in the assay include the cells described above as suitable for vaccine 
production. Preferably, the cells arc Afiican green monkey kidney cells (Vero) cells. Human 
embyronic lung fibroblasts or normal human diploid fibroblasts may also be used in the 
10 invention. 

In one embodiment, the invention includes a fluorescence based cytopathogenicity assay to 
measure the effect of a potential antiviral compound on a cell-based assay. One example of a 
fluorescence based cytopathogenicity assay is illustrated below. 

1 X lO'* Veto cells per well of a microtiter plate (MTP) are infected with a defined amount 
15 of SARS virus selected within the following ranges for optimal MOt 5-10, 10-25, 25-50, 50- = ■ 
100, 100-500, or 500-1000 PFU in a total volume of 200 fil media (M199 medium supplemented 
with 5% PCS, 2 mM glutamine, 100 lU/ml penicillin and 100 fig/nd streptomycin) in the 
presence or absence of the potential antiviral compound and incubated for at least 1, 2, 3, 4, 5, 6, 
or 7 days at 37°C, 5% CO 2 . The wells of the MTP are washed with PBS (200 fil) and then filled 
50 wifli 200 III PBS containing 10 iig/ml fluorescein diacetate. After a 45-min incubation at room 
temperature, fluorescence is measured at 485 nm excitation and 538 nm emission wavelengths. 
IC50 values arc determined by a nonlinear plot of antiviral activity as a function of drug 
concentration. 

Other cell based assays are known in the art and include, among others, methods of GFP 
,5 detection and Luc detection. In addition, a Promega kit is commnciaUy available that provides 
additional methods of measuring cell viability, etc. 

fit one embodiment, the invention includes a method of measuring the efficacy of a 
potential antiviral compound using RT-PCR to detect the levels of SARS viral RNA in the cell 
based assay. Methods of using RT-PCR are known in the art One example of such an assay is 
0 described below. 

5 X 10® Vero ceDs are seeded in tissue culture. Flasks containing the ceUs are incubated 
over night at 37°C, 5% CO 2 . The cells are infected (m.o J. = 1) with SARS virus in the presence 
and absence of potential antiviral compounds. Optionally, the cells may be pretreated with the 
potential compound prior to infection. In either case, a suitable control cell assay is also 
5 prepared. 



- 213 - 




wo 2004/092360 



PCT/US2004/011710 



The RNA of infected cells is purified at 2 h (UL54), 12 h (UL8) and 16 h (UL13) after 
infection, (Qiagen) RNA purification (RNeasy kit; 40 nl elution) and quantified (absorption at 
260 nm). The RNA (2 /tg) is reverse transcribed with a specific primer (2 pmol, using one of the 
primer pairs described herein) into cDNA according to theSuperscript II protocol (hivitrogen). 

5 Aliquots (2 /rl) of the reverse transcription reaction are amplified by PCR. Fragments of the 
appropriate target SARS gene, Lc., a gene encoding a SARS enzyme, are anipiifi^ in 30 cycles 
(UL54 and UL8: 3 min, 94°C hot start; 1 min, 94°C denaturation; 1 min, 55°C annealing; 1 min, 
72 °C polymerization. UL13: 3 min, 94 “C hot start; 1 min, 94 “C denaturation; 1 min, 60 °C 
anneabng, 1 min, 72°C polymerization) by PCR (Taq-Polymerase, Stratagene), in a 100-/il 
.0 reaction volume with the appropriate oligonucleotides, as described herein at 0.1 nmol each. 8-/il 
aliquots of cycle 20-30 Qanes 2-12) of the PCR were resolved on a 2% agarose gel (Invitrogen) 
according to the manufacturer’s instractions. 

Cell-based assays of the invention may optionally use a variant or derivative of a wild-type 

SARS virus that has reduced or attenuated virulence in humans and/or animal models (e.g., 

5 mouse, non-human primate, etc.) Use of such attenuated SARS viruses in screening methods 
may reduce srfety concerns and precautions that would otherwise be associated with the 
pathogenic nature of the SARS virus and may eliminate or reduce the need for the 
implementation of cumbersome high contairunent levels during perfotmance=of the assays and • 
screening of compounds. 

O’ The invention includes an enzyme-based assay that can be used to screen for and identify a 

therapeutically active agent for the treatment of SARS virus infection. 

An embodiment of the invention is an assay comprising contacting a known quantity of 
SARS protease in solution to a peptide contakung a detectable marker and cleavage site for 
SARS protease, wbraein SARS protease activity is monitored by measuring the intensity of the 
5 marker on the cleaved product. 

In a more particular embodiment, a method of assaying for SARS protease is provided 
comprising contacting a sample solution containing SARS protease with a peptide containing a 
fluorescent donor, fluorescent quencher, and cleavage site for SARS protease, said peptide being . 
detectable with a fluorometer when cleaved, wherein SARS protease activity is determined in the 
5 sample by the amount of fluorescence detected by the fluorometer. 

Assays based on the direct measurement of SARS protease inhibition may be utilized for 
screening for SARS therq>eutics. Protease for such assays such as 3C-like protease and papain- 
like protease may be isolated and purified for such assays as described in Seybert, et al„ J. Gen. 
Virol., 78:71-75, 1997, Ziebuhr, et aL, Adv. Exp. Med. Biol., 440:115-120, 1998, Sims, et al., 

) Adv. Exp. Med. Biol. 440: 129-134, 1998, Ziebuhr, etaL,Z. Virol., 73:177-185, 1999, Teng, et 
al., J. Virol., 73:2658-2666, 1999, Herold, et-oL, J. Biol. Chem. 274:14918-14925, 1999, and 
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Ziebuhr, e/ a/., J. Biol. Chem. 276:33220-33232, 2001. Furthennorc, Example 30 describes a 
novel mediod of purif^ng SARS protease using column chromatography. Example 31 describes 
a continuous fluorescence resonance energy transfer (FRET) assay for measuring SARS protease 
activity. Protease enzyme based assays such as the FRET assay demonstrated in Example 31 are 
5 readUy adapted for high-throughput screening and are used for screening candidate antiviral 
compounds. Performance of the protease enzymatic assay in the presence of a SARS protease 
inhibitor compound will show a decreased amount of fluorescence at a given fimft when 
compared to negative control assay containing no test compound on a non-inhibiting control 
compound. Such a method would involve the steps of: (a) providing an assay solution 

10 comprising SARS protease; (b) adding a test compound to the assay solution; (c) adding a 

substrate for SARS protease to the assay solution; and (d) measuring the proteolytic activity in 
the assay solution. In a preferred embodiment, the proteolytic activity is measured by the 
fluorescence of fluorophore product produced by the enzymatic activity of SARS protease. 

Attenuated SARS virus variants generally contain one or more genome modifications or 
1 5 mutations (e.g . , substitutions, deletions, insertions) in protein encoding or non-coding regions. 

Specific examples of attenuating mutations include, for example, genetic modifications in the 5% 
end noncoding region, leader sequoice, intergenic regions, 3'-end noncoding region, ORF la, 
ORF lb, S gene, E gene, M gene, N gene, or any of the nonstructuial protein genes outside of the 
ORF la/lb region. Preferred attenuating mutations are in a SARS virus structural protein (e.g., 
iO Spike (S)), a protease or polymerase domain, or a non-coding sequence (e.g . , 5'-end noncoding 
region, intergenic sequence). In addition, a cleavage site may be introduced or eliminated within 
the spike protein (see for example, Gombold et oL, J. Virol. 67:4504-4512, 1993; Bos etal. 
Virology 214:453-463, 1995), such modification that may also be useful for optimization of 
expression of recombinant spike protein antigen (e.g . , for vaccine purposes). 
t5 A variety of methods are used according to the present invention in order to obtain 

attenuated variants of SARS virus. Such methods include serial passage of the SARS virus in 
cultured cells (e.g . , ma m malian cell culture, such as fetal rhesus kidney cells or VERO cells), 
until the SARS virus demonstrates att^uated function. The serial propagation of virus may be 
performed at any temperature at which tissue culture passage attenuation occurs, and may be 
■0 performed in conjunction with one or more steps of mutagenesis (e.g., chemical mutagenesis). 

The attenuated phenotype of SARS virus variants, obtained rfter one or more cell culture 
passages, is readily measured by one skilled in the art. As used herein, attenuation refers to the 
decreased virulence of the SARS virus in a human subject Evidence of attftn n gtpd function may 
be indicated by decreased levels of viral replication or by decreased virulence in an animat 
5 model. 
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Otiier methods of producing an attenuated SARS vims include cell culture passage of the 
virus at sub-optimal tempmntures (cold passage), as well as introduction of attenuating mutations 
into the SARS viral genome by random mutagenesis (c.g., chemical mutagenesis, such as using 
5-fluorouracil) or using directed mutagenesis. Preparation and generation of attenuated RS V 
5 vaccines (the methods of which will genraally applicable to SARS virus) are disclosed in, for 
example, EP 0640128, US Patent No. 6,284,254, US Patent No. 5,922326, US Patent No. 
5,882,651. 

The number of passages required to obtain safe, immunizing attenuated virus is dependent 
at least in part on the conditions employed. Periodic testing of the SARS virus culture for 
10 virulence and immunizing ability in animals (e.g., mouse, primate) can readily determine the 
parameters for a particular combination of tissue culture and temperature. 

In another embodiment, the cell-based assay for screening of antiviral compounds is based 
on the readout of expression of a gene product (e.g . , reporter gene product) that is not from 
SARS virus. Gene products particularly suitable to the present invention include, but are not 
15 limited to those of the above-described assays. 

In order to achieve such a read-out, the gene-of-interest (GOI) encoding said gene reporter 
gene product must be incorporated into a replicating SARS virus genome or construct derived 
from a SARS virus genome (e.g., SARS virus replicon, SARS virus defective-interfering (DI) 
RNA). Rgure 13 is a schematic depicting locations for incorporation of the reporter gene into a 
20 SARS virus genome. Preferably, insertion of a heterologous reporter gene-of-interest is at a site 
between existing SARS virus genes, such as for example, as shown in Rgure 13. For example^ . 
the GOI may be inserted closely following the termination codon of a SARS virus gene (e.g., 
ORF lb, S, E, M, N). Insertion should be positioned in order to minimize disruption of mRNA 
transcription for the SARS virus gene(s). The GOI may also be inserted as an in-frame “fusion” 
25 with an existing SARS virus gene, such that sufficient function of the GOI is maintained for 
detection. To optimize expression, an additional SARS virus intergenic sequence (e.g., SEQ ID 
NO: 7388, with or without additional flanking SARS virus sequences) may also be engineered 
into a position preceding the inserted GOI. 

Incorporation of a GKDI into SARS virus may be accomplished by one of skill in the art 
10 using a variety of techniques. For example, one preferred method is targeted RNA 
recombination, that takes advantage of the ability of coronavirus RNAs to undergo 
recombination within the cell (see for example Rscher et al., J. Virol. 71:5148-5160, 1997; 
Koljesar et al., J. VeL Sci. 2:149-157, 2001). A construct of desired configuration (e.g., cDNA 
of defective interfering RNA of SARS virus) containing the GOI flanked by SARS virus 
•5 sequence (e.g., intergenic sequence) is generated such that RNA may be transcribed directly 
within a eukaryotic cell or in vitro and transfected into susceptible cells also infected with SARS 
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vims. Recombinant vims containing the GOI is identified based on expression of the GOT 
encoded marker. 

Alternatively, incorporation of a GOI into SARS vims may be accomplished by one of 
skill in die art by first assembling a full-length cDNA clone of the SARS vims, that can be used 
5 to produce infectious RNA transcripts in vivo (e.g. , from an RNA polymerase II promoter) or in 
vitro (e.g., fiom a bactedophage promoter). Although relatively long in genome length, such 
assembly of a full-length cDNA clone is now readily obtainable by one of skill in the art using 
standard molecular biology and reverse genetics techniques and the genome sequence of SARS 
virus (see for exarr^le, Thiel et al., J. Gen. Virol., 82:1273-1281, 2001; Almazan et al., Proc. 

10 Nad. Acad. Sci. USA 97:5516-5521, 2000; Thiel et al. (2003) J Gen Virol 82:1273-1281; Yount 
et al (2003) PNAS USA 100:12995-13000). Insertion of a heterologous GOI into a full-length 
SARS virus genome cDNA may be performed using a variety of techniques, such as for 
example, ligation into natural or synthetic restriction sites, PCR (e.g., overlapping PCR), and 
recombination. 

L5 It may also be desirable to utilize similar SARS vims recombinants containing a gene-of- 

interest for antiviral screening, however, with further modification to minimiz e or eliminate 
virus-induced cytopathology (e.g., CPE). Non-cytopathic derivatives fiom SARS vims may be 
obtained by one of skill in the art using a variety of methods. For example, a selectable marker 
(e.g. , drag resistance marker) may be incorporated as GOI into a SARS virus genome to produce 
20 infectious vims as described above (see for example, Perri et al., J. Virol., 74:9802-9807, 2000). 
Infectious GOI-containing SARS vims or infectious ^nome RNA/cDNA is then used to 
infect/traosfect cells (e.g. , VERO), with or without prior mutagenesis, after which time the 
infected cells are subjected to the appropriate selection. Only those cells containing SARS vims 
harboring both die selectable maricer and one or more mutations rendering the virus non- 
25 cytopathic will survive the selection process and grow out Active SARS vims replication in 
these cells is readily detected using a variety of detection techniques (e.g., PCR, Northern blot) 
and such cells may serve as the substrate for cell-based screening assays. Mutations that result in 
the desired noncytopathic SARS virus phenotype may include nucleotide substitutions, deletions 
or additions, and may occur in a variety of genome coding or non-coding regions (e.g., 5' or 3’- 
30 end noncoding regions, intergenic regions, ORFla, ORFlb, a protease domain, a polymerase 

domain). The identification of such mutations is readily accomplished by exchange of sequences 
with wild-type (e.g., parental) SARS virus and demonstrating transfer of the phenotype, and 
sequencing of the appropriate genome region. S imil ar mutations that reduce or e liminate 
cytopathogenicity also may be utilized in the context of a SARS vims derived replicon vector, 

35 either by similar selection directly using a SARS virus replicon or by specific engineering of the 
replicon based on mutation(s) identified in the context of infectious SARS vims as described 
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above. In addition, such mutations may serve as the basis for attenuated SARS virus derivatives, 
as described elsewhere in this document 

Alternatively, rather than using infectious SARS virus or its derivatives for cell-based 
screening assays, propagation defective “replicons” may be engineoed and Such 

5 replicons maintain ah protein encoding sequraces and cis replication sequences required for 
RNA replication and expression within a cell, but are deleted of one or more sequences or genes 
required for packaging of progeny SARS virus (see for example Curtis et ed., J. Virol., 76: 1422- 
1434, 2002). Rgure 14 is a schematic depicting representative examples of SARS virus 
replicons according to the present invaition. For example a SARS virus cDNA construct is 
10 generated, that is lacking one or more (or all) structural protein encoding genes, whraeby the 
missing SARS virus gene(s) is/are replace by the (^I, maintaining all necessary transcription 
signals for expression of the GOL Operably linked with the SARS virus replicon cDNA 
constract is a promoter for RNA polymerase that can be used to transcribe the replicon RNA in 
vivo (e.g., RNA polymerase II promoter) or in vitro (e.g., bacteriophage promoter). The SARS 
5 replicon may be introduced into a susceptible cell by transfection as RNA or DNA, depending on - 

the promoter of choice, and the transfected cells may be utilized for the evaluation of antiviral 
compounds. By incorporating one or more mutations rendering the replicon noncytopathic for 
the cells (see above), one can avoid the need for nucleic acid transfection each time an assay is to 
be performed. 

•0 Alternatively, SARS virus replicons may be packaged into virus like particles that allow 

mfection of cells, rather than requiring transfection of nucleic acid molecules. A requirement for 
replicon packaging is that essential SARS virus gene functions deleted ftom the replicon (e.g., 
one or more stmctural proteins) are provided in trans within the cell containing the replicon. A 
variety of methods for packaging of replicon RNA can be utilized to one of skill in the art (see ' 

5 for example, Curtis et al., ibi± Ortego, et al., J. Virol., 76:11518-11529, 2002). For example, 
stably transformed cell lines constitutively or inducibly expressing the required SARS virus gene 
functions may be utilized. Alternatively, the required SARS virus gene functions may be 
expressed by viral vectors that are introduced into the replicon-contaming cell. Alternatively a 
defective intetfming (DI) SARS virus derived RNA containing the required gene functions ma y 
D be introduced into the replicon-containing cell. Such DI constructs used to complement missing 

replicon functions may be more commonly referred to as defective helper RNA or defective 

helpers. 

Another configuration useful for cell-based antiviral screening assays according to the 
present invention utilizes SARS virus derived DI RNAs wicoding a GOI (see for example 
) Stirrups, et oL, J. Gen. Virol., 81:1687-1698, 2000; Liao, et al.. Virology 208:319-327, 1995). 
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btroduction of a SARS DI, eithw as cDNA linked to an RNA polymerase II promoter or as in 
vitro transcribed RNA, into susceptible cells also infected with SARS virus, allows for a readout 
of the GOI reporter product in assays. 

A replicon-based system for rapid identification of coronavirus replicase inhibitors is 
5 described by Hertzig et cd. (2004) J Gen Virol DOI 10. 1099/vir/0/80044-0. Briefly, the system 
uses a non-cytopathic selectable replicon RNA that can be stably maintained in eukaryotic cells. 
ITie replicon RNA mediates reporter graie expression as a marker for coronavirus replication, 
and expression of the reporter can be used to test the inhibitory effect of test compounds in vitro, 
thereby allowing high throughput screening for replicase inhibitors without the need to grow 
10 infectious virus. Preferred replicon RNAs include a neomycin resistance gene in the replica^ 
gene with a downstream reporter gene (e.g. GFP) that is expressed via replicase-mediated 
synthesis of a sub-genomic mRNA. 

VI. COMPOSITIONS AND METHODS FOR TREATMENT OP SARS VIRUS INFECTION 
The present invention relates to compositions and mediods for the treatment and/or prevention of 

15 SARS. The invention further includes a method for the treatment and/or prevention of SARS 

through the administration of a therapeutically effective amount of at least one antiviral, 
compound from among those described in the US Patents and published international patent 
applications listed in Table 1 and Table 2. In one embodiment of the method, the antiviral 
compoimd is a small molecule. In another embodiment, the antiviral compound is a protease 
50 inhibitor. In a further embodiment, the antiviral protease inhibitor is a 3C-like protease inhibitor 
and/or a papain-like protease inhibitor. Combined treatment with the lopinavir/ritonavir 
^aletra) protease inhibitor and ribavirin has shown a favorable clinical response (Chu et al. ; 

(2004) Thorax 59;252-256). In another embodiment, the antiviral compound is an inhibitor of an 
RNA dependent RNA polymerase. In another embodiment, a first antiviral compound that is a 
55 protease inhibitor is administered with a second antiviral compound that is an RNA-dependent 
RNA polymerase inhibitor. The invention further provides for the administration of a steroidal 
anti-inflammatory drug in combination with at least one antiviral compound, for example, fiom 
the antiviral compounds described in the documents listed in Table 1 and Table 2. A 
combination treatment of steroids and ribavirin has been described by Fujii et al. (2004) J Infect 
iO Chemother 10:1-7. A combination treatment of corticosteroids and interferon alfacon-1 has also 
been reported (Loutfy et aL (2003) JAMA 290:3222-3228). 

The invention further provides for a method for the treatment and/or prevention of SARS 
throu^ the administration of a therapeutically effective amount of at least one antiviral 
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compound from among those described in the US Patents and published international patent 
applications listed in Table 1 and Table 2 by inhalation. In another aspect, the antiviral 
compound may be administered in combination with a SMIP, SMIS, or other 
immunomodulatory compound such as those in Table 34 and in Table 35. Jn one embodiment of 
5 the method, the antiviral compound is a small molecule. In another embodiment, the antiviral 

compound is a protease inhibitor. In a further embodiment, the antiviral protease inhibitor is a 
3C-like protease inhibitor and/or a papain-like protease inhibitor. In another embodiment, the 
antiviral compound is an inhibitor of an RNA dependent RNA polymerase. In another 
embodiment, a first antiviral compound that is a protease inhibitor is administered with a second 
10 antiviral compound that is an RNA-dependent RNA polymerase inhibitor. The invention further 

provides for the admimstration of a steroidal anti-inflammatory drag in combination with at least 
one antiviral compound, for exan^le, from the antiviral compounds described in the documents 
listed in Table 1 and Table 2. The steroidal anti-inflammatory drug may be administered by 
inhalation for a local effect or administoed for systemic absorption such as via an oral or 
15 intravenous route. 

, The invention further provides for methods for treating SARS infection com prising 
administering a small molecule immunopotentiatar (SMIP) compound either alone or in 
combination with an antiviral compound or in combination with a SARS vaccine. In a further 
embodiment, the SMIP is a compound disclosed herein or set forth in Table 34. 

10 The invention further provides for methods for treating SARS infection comprising 

administering an immunosuppressant compound, optionally a small molecule suppressant 
(SMIS) compound either alone.or in combination with an antivird compound. In a further 
embodiment, the immunosuppressant compound is disclosed herein or set forth in Table 35. 

The invention further provides peptidic immunomodulating compositions, that include 
'5 oligo and polypeptides, capable of effecting inflammatory response in a patient In one 

embodiment the peptidic immunomodulating composition is able to stimulate human cells to 
produce cytokines. In another embodiment the peptidic inununomodulating composition is 
capable of decreasing cytokine levels in the human. Preferred Examples of peptidic 
immunomodulating compositions include those listed in Table 35, as well as TGFP2, TGppi, 

0 TGFp3, thymopentin (TP5), P-mercaptopropionyl-arginyl-lysyl-aspartyl-valyl-tyrosyl-cysteine 
amide, colostrinine, lactoferrin (LF)> cyclolinopeptide A (C3^), and tuftsin (TKPR). The 
peptidic inununomodulating compositions of the invention may be used alone or in combination 
with other agents, preferably antiviral compounds, for the treatment of SARS. 
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TTie invention furthra* provides for a kit for use by a consumer for die treatm^t and/or 
prevention of SARS. Such a kit comprises: a) a pharmaceutical composition comprising a 
therapeutically effective amount of at least one antiviral, SMIP, SMIS, or other 
immunomodulating compound from among tiiose described in the US Patents and published 
5 international patent plications listed in Table 1, Table 2, Table 34 and Table 35 and a 
pharmaceutically acceptable carrier, vehicle or diluent; b) a container for holding the 
pharmaceutical composition; and, optionally, c) instructions describing a method of using the 
pharmaceutical compositions for the treatment and or the prevention of SARS. TTie kit may 
optionally contain a plurality of compounds for the treatment of SARS wherein the antiviral 
10 compounds are selected from 3C-like protease inhibitors and papain-like protease inhibitors. In 

a further embodiment, the kit contains an antiviral compound that is an RNA-dependent RNA 
polymerase inhibitor. When the kit comprises more than one antiviral, SMTP, SMIS, or other 
immunomodulating compoimd, the compounds contained in the kit may be optionally combined 
in the same pharmaceutical composition. 

15 An additional aspect of the invention provides for the use of at least one of the antiviral, 

SMIP, SMK, or other immunomodulating compounds described in the US Patents and published 
international patent apphcations listed in Table 1, Table 2, Table 34 and Table 35 for the 
manufacture of a medicament for the treatment or prevention of SARS. 

An additional aspect of the invention provides for the use of at least one SMIP compound, 
iO or at least one immunosuppressant compound, or at least one SMIS compound for the 

manufacture of a medicament for the treatment or prevention of SARS. Preferred SMTP, 
immvmosuppressant, and SMIS compounds are described herein. 

Unless otherwise specified, the following terms, when used within Section VI: 
“Compositions and Methods for Treatment of SARS Virus Infection” of die present application 
15 have the meanings as defined below: 

As used hraein, "limit", “treat” and “treatment are interchangeable terms as are "limiting" 
and "treating" and, as used herein, include preventative (e.g., prophylactic) and palliative 
treatment or the act of providing preventative or palliative treatment The terms include a 
postponement of development of SARS symptoms and/or a reduction in the severity of such 
iO symptoms that will or are expected to develop following infection with a SARS vinis. The terms 
furthCT include ameliorating existing SARS symptoms, preventing additional symptoms, 
ameliorating or preventing the underlying metabolic causes of symptoms. 

Representative uses of the compositions and methods of the present invention include: the 
elimination or reduction of the viral load of the SARS virus in a vertebrate, including humans, 

5 the elimination or reduction of symptoms associated with SARS, and a reduction in morbidity 
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associated with SARS. In a SARS patient population, the use of the conapositions and methods 
of the invention will result in the reduction in the high mortality rates associated with SARS. 

Infection with the SARS virus and the symptoms associated with SARS can be treated in a 
subject by administering the compositions of the invention. The compositions of the invention 
5 may be administered systemicaUy. For systemic use, the compounds hojein are formulated for 

parenteral (e.g., intravenous, subcutaneous, intramuscular, intraperitoneal, intranasal or 
transdermal) or enteral (e.g., oral or rectal) delivery according to conventional methods. 
Intravenous administration can be by a series of injections or by continuous infusion over an 
extended p^od. Administration by injection or other routes of discretely spaced administration 
10 can be performed at intervals ranging from weekly to once to three times daily or more. 
Alternatively, the compositions disclosed herein may be administered in a cyclical maimer 
(administration of disclosed composition, followed by no administration, followed by 
administration of disclosed compositions, and the like). Treatment will continue until the desired, 
outcome is achieved. 

15 A “subject” is a vertebrate animal including a human that is in need of treatment with the 

compositions, inethods and kits of the present invention. The term “subjecf ’ or “subjects” is 
intended to refer to both the male and female gender unless one gender is specifically indicated. 

"Coadministration" of a combination of a plurality of antiviral compounds means that these 
. components can be administered together as a composition or as part of the same, unitary dosage 

20 form. "Co-administration" also includes administering a plurality of antiviral compounds 

sqiarately but. as part of the same therapeutic treatment program or regimen. "Co-administration" 
also includes administering a plurality of other agents, such as, for example an oligopeptide, a 
polypeptide, a peptidic immunomodulator, nucleic acid, antibodies, or a vaccine wherein the 
compounds or agents are administered separately but as part of the same therapeutic treatment ' 
25 program or regimen. The components need not necessarily be administered at essentially the 
same time, although they can if so desired. "Co-administration" also includes sqiarate 
administration at different times and in any order. For example, where appropriate a patient may 
take one or more component(s) of the treatment in the morning and the one or more of the oth^ 
component(s) at night 

30 By "antiviral compound" as' used herein is meant an antiviral compound as described in the 

US Patents and published international patrait applications listed in Table 1 and Table 2. The US 
Patents and published international patrait applications listed in Table 1, Table 2 and Table 35 
are incorporated herein in their entirety. In one embodiment, the antiAdral compound is an RNA- 
dependent RNA polymerase, hi another preferred embodiment the antiviral compound is a 3C- 
35 like protease inhibitor or a p^ain-like protease inhibitor. The antiviral compounds may be 
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10 



15 



administered in (he form of the add, or of a soluble alkali metal salt or alkaline earth metal salt •' 
where appropriate. 

The precise dosage of the antiviral compound will vary with the dosing schedule, the oral 
potency of the particular antiviral compound chosen, the age, size, sex and condition of the 
subject, the severity of the disorder to be treated, and other relevant medical and physical factors. 
Thus, a predse pharmaceutically effective amount cannot be specified in advance and can be 
readily determined by the caregiver or clinidan. 

Generally, an appropriate amount of antiviral compound is chosen to obtain a reduction in 
the load of the SARS virus in the subject and/or to obtain a reduction in the symptoms assodated 
with SARS. For humans, an effective oral dose of antiviral compound is typically from about 
1.5 to about 6000 /xg/kg body weight per day and preferably about 10 to about 2000 /tg/kg of 
body weight per day. 

One of ordinary skill in the art will recognize that certain antiviral, SMIP, SMIS, and 
immunomodulaling compounds of the invention including 3C-like protease inhibitors, p£q>ain- 
like protease inhibitators, and RNA-dependent RNA polymerase inhibitors will contain one or 
more atoms that may be in a particular st^eochemical, tautomeric, or geometric configuration, 
giving rise to stereoisomers, tautomers and configurational isomers. All such isomers and 
mixtures thereof are included in this invention, when active. Crystalline and amorphous forms of 
the antiviral compounds of Ihis invention are also included as are hydrates, solvates, polymorphs, 
and isomorphs of the antiviral compounds of the invention. 

SMIP compounds of the invention include compounds are described in issued U.S. Patoit 
Nos. 4,547,5 1 1 and 4,738,97 1 with the general structure (a): 




a 

25 for the treatment of disorders responsive to agents that enhance cell-mediated immunity. 

Immunostimulatory oligonucleotides and polynucleotides are described in PCT WO 
98/55495 and PCJT WO 98/16247. U.S. Patent Application No. 2002/0164341 describes 
adjuvants including an unmethylated CpG dinucleotide (CpG ODN) and a non-nucleic add 
adjuvant. U.S. Patmit Application No. 2002/0197269 describes compositions comprising an 
10 antigen, an antigenic CpG-ODN and a polycationic polymer. 
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Additionally, issued U.S. Patent Nos. 4,689,338, 5,389,640, 5,268,376, 4,929,624, 
5,266,575, 5352,784, 5,494,916, 5,482,936, 5,346,905, 5395,937, 5338,944, 5325,612, 
W099^9693 and U.S. Ser. No. 09/361,544 disclose confounds of die geno:al structure (b): 

R R' 



w 




for the use as “immune response modifiers.” 

Further compounds with SMTP and antiviral activity are described below and in US 
Patent Application entided Thiosemicaibazones as Anti-Virals and finmunopotentiators filed on 
December 29, 2003 ivith an attorney docket number of PP19814.004US generally Hisr.lnging 
10 compoimds of the following structures; 

A compound of formula c: ' 



v 

Y" 



I I 

R" Z’ 



c 

wherein: E is absent or selected from the group consisting of alkyl, substituted alkyl, 

5 cycloalkyl, substituted cycloalkyl, aryl, substituted aryl, heterocyclyl, substituted heterocyclyl, 
heteroaryl, and substituted heteroaryl; 

L is absent or is selected from the group consisting of oxo, amino, alkylene, substituted 
alkylene, alkoxy, alkylamino, aminoalkyl, heterocyclyl, carbocyclyl, and carbonyl; 

W is absent or selected from the group consisting of cycloalkyl, substituted cycloalkyl, 
0 aryl, substituted aryl, heterocyclyl, substituted heterocyclyl, heteroaryl, and substituted 
heteroaryl; 

X is absent or is selected from the group consisting of oxo, amino, alkylene, substituted 
alkylene, alkoxy, alkylamino, aminoalkyl, heterocyclyl, carbocyclyl, and carbonyl; 

Y is selected fiom the group consisting of cycloalkyl, substituted cycloalkyl, aryl, 

5 substituted aryl, heterocyclyl, substituted heterocyclyl, heteroaryl, and substituted heteroaryl; 
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Y* is absent or is selected ftom the group consisting of F, Q, Br, I, nitro, alkyl, 
substituted alkyl, and optionally substituted heterocycl^, amino, alkylamino, dialkylamino; 

Y” is absent or is selected from the group consisting of F, Q, Br, I, nitro, alkyl, substituted alkyl, 
and optionally substituted heterocyclyl, amino, alkylamino, dialkylamino; 

5 R’ is H, alkyl, or substituted alkyl; 

R”isH,or 

R’ and R’ ’ are taken together to form a hetercyclic ring; 

Z and Z’ are independently selected firom the group consisting of hydrogen, alkyl, substituted 
alkyl, aryl, substituted aryl, arylalkyl, substituted arylalkyl, heteroaryl, substituted heteroaryl, 

10 heteroarylalkyl, substituted heteroarylalkyl, alkoxy, substimted alkoxy, aminocarbonyl, 
alkoxycarbonyl, carboxyl sulfonyl, methanesulfonyl, and substituted or unsubstituted 
alkylcarbonyl, arylcaibonyl, aralkylcarbonyl, heteroarylcarbonyl, heteroaralkylcaibonyl, 
alkylcaibonyloxy, arylcarbonyloxy, aral^lcarbonyloxy, heteroarylcarbonyloxy, • 
heteroaralkylcaibonyloxy, alkylaminocarbonyloxy, arylaminocarbonyloxy, formyl, 

15 loweralkylcarbonyl, loweralkoxycarbonyl, aminocarbonyl, aminoaryl, aUcylsulfonyl, 
sulfonamido, aminoalkoxy, alkylamino, heteroarylamino, alkylcaibonylamino, 
nlkylaminocaibonylaniino, arylaminocarbonylamino, aralkylcarbonylamino, 
heteroarylcaibonylamino, arylcarbonylamino, cycloamidino, cycloalkyl, cycloknido, 
arylsulfonyl and arylsulfonamido; or 

20 Z and Z’ are taken together to form a heterocyclic group, that may be optionally substituted and 
the tautomers and the pharmaceutically acceptable salts^ esters, or prodrugs thereof. 

. . Further SMIP compounds are described below and in US Patent Application 10/762873, 
Use of Tryptanthrin Compounds for Immune Potentiation, filed on January 21 , 2004 and 
disclosing the general embodiment of compounds represented by Formula (d): 



25 




(d) 



wherein 

A, B, C, D, E, F, G, and H are independently selected from carbon and nitrogen, or A and 
B and/or C and D can be taken together to be nitrogen or sulfur; 
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Rl, R2, R3, R4, R3, and Rjq are ind^ndently selected from the group consisting of hydrogen, 
halogen, low^alkyl, alkyl, substituted alkyl, cycloalkyl, heterocyclyl, alkylheteiocyclyl, 
substituted heterocyclyl, substituted alkenyl, amino, (substituted alkyl)(alkyl)amino, imino, 
haloloweralkyl, hydroxy, alkoxy, substituted alkoxy, hydroxyalkylthio, nitro, alkylsulfonyl, N- 
5 alkylsulfonamide, arylalkyl, arylalkylaryl, arylaiyl, aryloxy, arylamino, acylamino. 

acyloxyamino, alkylaminoacylamino, alkylaminosulfonylamino, alkylamino, alkenylamino, 
dialkylanuno,' alkoxyaDcylarnino, alkoxyalkylheterocyclyl, meicaptoalkoxyalkyl, cyano, 
formyl, -COORjj wherein Rj| is hyrtogen, loweralkyl, aryl, heterocyclyl, monosaccharide or 
disaccharide, and -CONR12R13 wherein Rj2 and R13 are independently selected from hydrogen, 

10 loweralkyl, aryl, heterocyclyl, saccharide, peptide and amino acid residues; or Rj and R3 taken 
together form a six membered aromatic ring; 

R7 and R9 are independently selected from hydrogen, halogen, loweralkyl, 
haloloweralkyl, cycloalkyl, heterocyclyl, substituted heterocyclyl or heterocyclylalkyl; and 
Rl> R2. R3. R4, R?. Rs. R9,and Rjq are absent when the ring atom to which they would 
5 otherwise be bonded is sulfur or double-bonded nitrogen; or 

the a pharmaceutically acceptable salts, esters, or prodrugs thereof, 

provided that Ri, R2, R3, R4, R,, Rg, R,, and Rio are not all hydrogen when A, B, C, E, P, and 
H are carbon. 

In one embodirnent, the compounds of Formula (I) have a backbone structure wherein D is 
0 nitrogen, and A-C and E-H are carbon. 

In one embodiiuent, when D is carbon, at least one, or at least two of Rj - R4 and R7 - Rjq are 
not hydrogen. 

In one embodiment, Rj flirough R4^ and Rg and R^q are independently selected from at least two 
of the group consisting of hydrogen, halogen, loweralkyl, cycloalkyl, heterocyclyl, substituted 
5 heterocyclyl, alkylheterocyclyl, amino, imino, haloloweralkyl, alkoxy, nitro, alkylsulfonyl, 
arylalkyl, arylalkylaryl, arylaryl, aryloxy, arylamino, acylamino, acyloxyamino, 
alkylaminoacylamino, alkylaminosulfonylamino, alkylamino, alkenylamino, dialkylamino, 
alkoxyalkylamino, alkoxyalkylhetraocyclyl, mercaptoalkoxyalkyl, cyano, formyl, -COORji 
where Rj j is hydrogen, lowranlkyl, aryl, heterocyclyl, monosaccharide or disaccharide, 

) and -CONR12R13 where Rj2 and R13 are independently selected from hydrogen, loweralkyl, 

aryl, heterocyclyl, saccharide, peptide and amino acid residues; and R4 is not present when D is 
nitrogen. 
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In an additional embodinient, 4A, B, C, D, E, F, G, and H are independenfly selected fiom 
caibon and nitrogen; 

Rj, R 2 , Rj, R 4 ^ Rs and Rjg are independently selected from die group consisting of 
hydrogen, halogen, loweralkyl, alkyl, substituted alkyl, heterocyclyl, substituted heterocyclyl, 

5 substituted alkenyl, (substituted alkyl)(alkyl)aniino, haloloweralkyl, hydroxy, aDcoxy, substituted 
alkoxy, hydroxyalkylthio, nitro, N-alkylsulfonamide, cyano, -COORji wherein Rn is hydrogen, 
loweralkyl, aryl, heterocyclyl, monosaccharide or disaccharide, and -CONR 12 R 13 wherein R 22 
and Ri 3 are independently selected from hydrogen, loweralkyl, aryl, heterocyclyl, saccharide, 
peptide and amino acid residues. 

10 

For the compounds described herein: ■ 

The term "loweralkyl" refers to branched or straight chain acyclical alkyl groups 
comprising pne.to ten caibon atoms, including, e.g., methyl, ethyl, propyl, isopropyl, n^butyl, 
t-butyl, neopentyl and the like, 

15 The term “alkyl” refers to alkyl groups that do not contain heteroatoms. Thus the term 

includes strai^t chain alkyl groups such as methyl, ethyl, propyl, butyl, pentyl, hexyl, heptyl, 
octyl, nonyl,decyl,undecyl,dodecyl and the like. The phrase also includes branched chain 
isomers of straight chain alkyl groups, including but not limited to, the following that are 
provided by way of example: -CH(CH 3 ) 2 , -CHCCHsXCJIzCHs), -CH(CH 2 CH 3 ) 2 , - 
:0 C(CH3)3, -C(CH2CH3)3. -CH2CH(CH3)2, -CH 2 CH(CH 3 )(CH 2 CH 3 ), - 

CH2CH(CH2CH3)2. -CH2C(CH3)3, -CH 2 C(CH 2 CH 3 ) 3 , - 

CH(CH3)CH(CH3)(CH2CH3), -CH2CH2CH(CH3)2, -CH 2 CH 2 CH(CH 3 )(CH 2 CH 3 ), - 
CH2CH2CH(CH2CH3)2. -CH2CH2C(CH3)3, -CH 2 CH 2 C(CH 2 CH 3 ) 3 , - 
CH(CH3)CH2CH(CH3)2, -CTI(CH3)CH(CH3)CH(CH3)2, - 
.5 CH(CH 2 CH 3 )CH(CH 3 )CH(CH 3 )(CH 2 CH 3 ), and others. The term also includes cyclic alkyl 

groups such as cyclopropyl, cyclobutyl, cyclopentyl, cyclohexyl, cycloheptyl, and cyclooctyl and 
such rings substituted with straight and branched chain alkyl groups as defined above. The term 
also includes polycyclic alkyl groups such as, but not limited to, adamantyl norbomyl, and 
bicyclo[ 2 . 2 . 2 ]octyl and such rings substimted with straight and branched chain alkyl groups as 
0 defined above. Thus, the phrase unsubstituted alkyl groups includes piimaiy alkyl groups, 
secondary alkyl groups, and tertiary alkyl groups. Unsubstituted aliyl groups may be bonded to 
one or more carbon atom(s), oxygen atom(s), nitrogen atom(s), and/or sulfur atom(s) in the 
parait compound. Preferred unsubstituted alkyl groups include straight and branched chain alkyl 
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groups and cycUc alkyl groups having 1 to 20 carbon atoms. More preferred such unsubstituted 
alkyl groups have from 1 to 10 carbon atoms while even more preferred such groups have fiom 1 
to 5 carbon atoms. Most preferred unsubstituted alkyl groups include straight and branched 
chain alkyl groups having from 1 to 3 carbon atoms and include methyl, ethyl, propyl, and - 
5 CH(CH3)2. 

The phrase “substituted alkyl” refers to an unsubstituted alkyl groiq) as defined above in 
which one or more bonds to a carbon(s) or hydrogen(s) are replaced by a bond to non-hydrogen 
and non-carbon atoms such as, but not limited to, a halogen atom in halides such as F, Cl, Br, 
and I; a phosphorus atom in groups such as phosphate and dialkyl alkylphosphonate; oxygen 
10 atom in groups such as hydroxyl groups, alkoxy groups, aryloxy groups, and ester groups; a 

sulfur atom in groups such as thiol groups, alkyl and aryl sulfide groups, sulfone groups, sulfonyl 
groins, and sulfoxide groups; a nitrogen atom in groups such as amines, amides, alkylamines, 
dialkylamines, arylamines, alkylarylamines, diarylamines, N-oxides, imides, and enamines; a 
silicon atom in groups such as in trialkylsilyl groups, dialkylarylsilyl groups, alkyldiarylsilyl 
15 groups, and triarylsilyl groups; and other heteroatoms in various other groups. Substituted alkyl 

groups also include groups in which one or more bonds to a carbon(s) or hydrogen(s) atom is 
replaced by a bond to a heteroatom such as oxygen in carbonyl, carboxyl, and ester groups; 
nitrogen in groups such as imines, oximes, hydrazones, and nitriles. Preferred substituted alkyl ‘ 
groups include, among others, alkyl groups in which one or more bonds to a carbon or hydrogen 
« atom is/aie replaced by one or more bonds to fluorine atoms. One example of a substituted alkyl : 
group is the trifluoromethyl group and other alkyl groups that contain the tiifluoromethyl group. 
Other alkyl groups include those in which one or more bonds to a carbon or hydrogen atom is 
replaced by a bond to an oxygen atom such that the substituted alkyl group contains a hydroxyl, 
alkoxy, aryloxy group, or heterocyclyloxy group. Still other alkyl groups include alkyl groups 
.5 that have an amine, alkylamine, dialkylamine, arylamine, (alkyl)(aryl)ataine,diarylamine, 
heterocyclylamine, (alkyl)(heterocyclyl)amine, (aryl)(hetox)cyclyl)amine, or 
diheterocyclylamine group. 

The term "alkoxy" refers to RO- wh^ein R, for example, is alkyl such as loweralkyl 
defined above. Representative exan^les of loweralkyl alkoxy groups include methoxy, ethoxy, 

0 t-butoxy and the like. 

The phrase “substituted alkoxy” refers to RO-, where R is, for example, an alkyl 
substituted, for example, with a halogen. RO is for example OCF 3 . 
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The tom "alkenyl" refers to a branched or straigjit chain groups comprising two to 
twenty carbon atoms that also comprises one or more carbon-carbon double bonds. 
Representative alkenyl groups include prenyl, 2-propenyl (i.e., allyl), 3-methyl-2-butenyl, 
3,7-dimethyl-2,6-octadienyl, 4,8-dimethyl-3,7-nonadienyl, 3,7,1 l-trimethyl-2,6,10-dodecatrienyl 
5 and the Uke. 

The phrase “substituted alkenyl” refers to alkenyl groups that are substituted, for 
example, diethyl hex-5-enylphosponate, and others with an alkyl or substituted alkyl group such 
as dialkyl phosphate or an esto: such as an acetate ester. 

•' The phrase “dialkyl amino” refers to an amino group substituted with two alkyl groups 
10 such as Cl-20 alkyl groups. 

The phrase “substituted dialkyl amino” refers to a dialkylamino substituted, for example, 
with a carboxylic acid, ester, hydroxy or alkoxy. . . 

The term “hydroxyalkylthio” refers to a thio radical to which is appended a hydroxyalkyl 
group, where the alkyl is for example lower alkyl. An example is hydroxyethylthio, - 
15 SCH 2 CH 2 OH. i 

The term “N-alkylsulfonamide” refers to the group -S02NHalkyl, where alkyl is, for 
example, octyl. 

The tom "alkynyl" refers to a branched or straight chain comprising two to twenty 
carbon atoms that also comprises one or more carbon-carbon triple bonds. Representative 
iO alkynyl groups include ethynyl, 2-propynyl (propargyl), 1-propynyl and the like. 

The term “aryl” refers to aryl groups that do not contain heteroatoms. Thus the term 
includes, but is not limited to, groups such as phenyl, biphenyl, anthracenyl, naphthenyl by way 
of example. Although the phrase “unsubstituted aryl” includes groups containing condensed 
rings such as naphthalene, it does not include aryl groups that have other groups such as alkyl or 
15 halo groups bonded to one of the ring members, as aryl groups such as tolyl are considered 
herein to be substimted aryl groups as described below. A preferred unsubstituted aryl group is 
phenyl. Unsubstituted aryl groups may be bonded to one or more carbon atom(s), oxygen 
atom(s), nitrogen atom(s), and/or sulfur atom(s) in the parent compound, however. 

The phrase “substituted aryl group” has the same meaning with respect to aryl groups that 
10 substituted alkyl groups had with respect to alkyl groups. However, a substituted aryl group also 
includes aryl groups in which one of the aromatic carbons is bonded to one of the non-carbon or 
non-hydrogen atoms described above and also includes aryl groups in which one or more 
aromatic carbons of the aryl group is bonded to a substituted and/or unsubstituted alkyl, alkenyl. 
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or alkynyl group as defined herein. This includes bonding airangemMits in which two carbon 
atoms of an aryl group are bonded to two atoms of an alkyl, alkenyl, or alkynyl group to define a 
fused ring system (e.g. dihydronaphthyl or tetrahydronaphthyl). Thus, the phrase “substituted 
aryl” includes, but is not limited to tolyl, and hydroxyphenyl among others. 

5 The term arylalkyl refers to a loweralkyl radical to which is ^pended an aryl group. 

Representative arylalkyl groups include benzyl, phenylethyl, hydroxybenzyl, fluorobenzyl, 
fluorophenylethyl and the like. 

The phrase “unfiised arylaryl” refers to a group or substituent to which two aryl groups, 
that are not condensed to each other, are bound. Exemplary unfused arylaryl compounds include, 
10 for example, phenylbenzene, diphenyldiazene, 4-methylthio-l-phenylbenzene, phenoxybenzene, 
(2-phenylethynyl)benzene, diphenyl ketone, (4-phenylbuta-l,3-diynyl)benzene, 
phenylbenzyiamine, (phenylmethoxy)benzene, and the like. Preferred substituted unfused 
arylaryl groups include: 2-(phenylamino)-N-[4-<2-phenylethynyl)phenyl]acetamide, 1 , 4 - 
diphenylbenzene, N-[4-(2-phenylethynyl)phenyl]-2-[beEizylamino]acetamide, 2-amino-N-[4-(2- 
5 phenylethynyl)phenyl]propanamide, 2-amino-N-[4-(2-phenylethynyl)phenyl]acetamide, 2 - 
(cyclopropylamino)-N-[4-(2-phenylethynyi)phenyl]acetamide, 2-(ethylamino)-N-[4-(2- 
phenylethynyl)phenyl]acetamide,2-[(2-methylpropyl)ainino]-N-[4-(2- 
phenylethynyl)phenyl]acetamide,5-phenyl-2H-benzo[d]l,3-dioxolene,2-chloro-l-niethoxy-4- . 
phenylbenzene, 2-[(iimdazolylmethyl)amino]-N-[4-(2-phenylethynyl)phenyl]acetaraide, 4- 
0 phenyl-l-phenoxybaizene, N-(2-aminoethyl)[4-(2-phenylethynyl)phenyl]carboxamide, 2-{[(4- . ■ 
fluorophenyl)methyl]amino}-N-[4-(2-phenylethynyl)phenyl]acetamide,^{[(4- 
methylphenyl)methyl]amino}-N-[4-(2-phenylethynyl)phenyl]acetamide,4-phenyl-l- 

(trifluoromethyl)benzene, l-butyl-4-phenylbenzene, 2-(cyclohexylamino)-N-[4-(2- 
phenylethynyl)phenyl] acetamide, 2-(ethylmethylamino)-N-[4-(2- 
5 phenylethynyl)phenyl]acetamide, 2-(butylamino)-N-[4-(2-phenylethynyl)phenyl]acetamide, N- 
[4-(2-phenylethynyl)phenyl]-2-(4-pyridylamino)acetamide,N-[4-(2-phenylethynyl)phenyl]-2- 
(quinuclidin-3-ylamino)acetamide,N-[4-(2-phenylethynyl)phenyl]pynDHdin-2-ylcarboxamide, 
2-amino-3-methyl-N-[4-(2-phenylethynyl)phenyl]butanamide,4-(4-phenylbuta-l,3- 
diynyl)phenylamine,2-(dimethylamino)-N-[4-(4-phenylbuta-l,3-diynyl)phenyl]acetamide,2- 
) (ethylamino)-N-[4-<4-phenylbuta-13-diynyl)phenyl]acetamide, 4-ethyl-l -phenylbenzene, l-[ 4 - 
(2-phenylethynyl)phenyl]ethan-l-one, N-(l-carbamoyl-2-hydroxypropyl)[4-(4-phenylbuta-l,3- 
diynyl)phenyl]carboxamide,N-[4-(2-phenylethynyl)phenyl]propanamide,4-methoxyphenyl 

phenyl ketone, phenyl-N-benzamide, (tert-butoxy)-N-[(4-phenylphenyl)methyl]carboxamide, 2 - 
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(3-phenylphenoxy)ethanehydroxaniic add, S-phenylphrayl piopanoate, l-(4-ethoxyphenyl)-4- 
methoxybenzene, and [4-(2-phMiylethynyl)phenyl]pytrole. 

Tbe phrase “unfused heteroarylaryl” refers to a unfiised arylaryl group where one of the 
aryl groups is a heteroaryl group. Exemplary heteroarylaryl groups indude, for example, 2- 
5 phenylpyridine, phenylpyrrole, 3-(2-phenylethynyl)pyridine, phenylpyrazole, 5-(2- 
phenyIethynyl)-l,3-dihydropyriimdine-2,4-dione, 4-phenyl-l,2,3-thiadiazole, 2-(2- 
phenylethynyl)pyrazine, 2-phdiyltfaiophene, phenylimidazole, 3-(2-piperazinylphenyl)furan, 3- 
(2,4-dichlorophenyl)-4-methylpynole, and the like. Preferred substituted unfused heteroarylaryl 
groups indude: 5-(2-phenylethynyl)pyrmiidine-2-ylaiiune, l-metfaoxy-4-(2-thienyl)benzene, 1- 
10 methoxy-3-(2-thienyl)benzene, 5-methyl-2-phenylpyridine, 5-methyl-3-phenylisoxazole, 2-(3- 
(tiifluoromethyl)phenyl]furan, 3-fluoro-5-(2-furyl)-2-methoxy-l-prop-2-enylbenzene, 
(hydroxyimino)(5-phenyl(2-thienyl))methane, 5-[(4-methylpiperazinyl)methyl]-2- 
phenylthiophene, 2-(4-ethylphenyl)thiophene, 4-methylthio-l-(2-thienyl)benzene, 2-(3t 
nitrophenyl)thiophene, (tert-butoxy>N-[(5-phenyl(3-pyridyl))methyl]carboxamide, hydroxy-N- 
15 [(5-phenyl(3-pyridyl))methyl]amide, 2-(phenylmethylthio)pyridine. and benzylimidazole. 

The phrase “unfused heteroarylheteroaryl” refers to an unfused arylaryl group where both 
of the aryl groups is a heteroaryl group. Exemplary heteroarylheteroaryl groups include, for 
example, 3-pyridylimidazole, 2-imidazolylpyrazine, and the like. Preferred substituted unfused 
heteroarylheteroaryl groups include: 2-(4-piperazinyl-3-pyridyl)furan, diethyl(3-pyrazin-2-yl(4- 
10 pyridyl))amine,anddimethyl{2-[2-(5-methylpyrazin-2-yl)ethynyl](4-pyridyl))amine. 

The phrase “fused arylaryl" refers to an aryl group as previously HftfinfiH that is 
condensed, and fully conjugated to an aryl group. Representative fused arylaryl groups include 
biphenyl, 4-(l-naphthyl)phenyl, 4-(2-naphthyl)phenyl and the like. . 

The phrase “fused heteroarylaryl" refers to an aryl group as previously defined that is 
15 condensed, and fully conjugated with a heteroaryl group. Representative fused heteroarylaryl 
groups include quinoline, quinazoline and the like. 

Hie phrase “fused heteroarylheteroaryl" refers to a heteroaryl group as previously defined 
that is condensed, and fully conjugated with another heteroaryl group. R^resentative fused 
heteroarylheteroaryl groups include pyrazalopyrimidine, imidazoquinoline and the like. 

10 The term "aryloxy" refers to RO- wherein R is an aryl group. Representative arylalkoxy 

group include benzyloxy, phenylethoxy and the like. 

Hie term "arylalkoxy" refers to a lower alkoxy radical to which is appended an aryl 
group. Representative arylalkoxy group include benzyloxy, phenylethoxy and the like. 
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The tom "aryloxyaryr refers to an aryl radical to which is appended an aryloxy group. 
Representative aryloxyaryl groups include 4-phenoxyphraiyl, 3-phenoxyphenyl, 
4-phenoxy-l-naphthyl, 3-phenoxy-l-naphthyl and the like. 

The term "aryloxyarylalkyl" refers to an arylalkyl radical to which is appended an 
5 aryloxy group. Representative aryloxyarylalkyl groups include 4-phenoxyphenylmethyl, 
3-phenoxyphenylmethyl, 4-phenoxyphenylethyl, 3-phenoxy-phenylethyl and the like. 

The term "arylalkoxyaryl" refers to an aryl radical to which is appended an arylalkoxy 
group. Representative arylalkoxyaryl groups include 4-benzyloxylphenyl. 3-benzyloxyphenyl 
and the like. 

P The term "arylalkoxyarylalkyl" refers to an arylalkyl radical to which is appended an 

arylalkoxy group. Representative arylalkoxyarylalkyl groups include 4-benzyloxylbenzyl, 
3-benzyloxybenzyl and the like. 

The term "cycloalkyl" refers to an alicyclic group comprising jhom 3 to 7 carbon atoms 
mcluding, but not limited to, cyclopropyi, cyclobutyl, cyclopentyl, cyclohexyl and the like. 

5 The term "cycloalkylalkyl" refers to a loweralkyl radical to which is appended a 

cycloalkyl group. Representative examples of cycloalkylalkyl include cyclopropylmethyl, 
cyclohexylmethyl, 2-(cyclopropyl)ethyI and the like. 

The term "halogen" refers to iodine, bromine, chlorine or fluorine; "halo" refers to iodo, 
bromo, chloro or fluoro. 

3 The tom "haloalkyl" refers to a lower alkyl radical, as defined above, bearing at least one 

halogen substituent, for example, chloromethyl, fluoroethyl or trifluoromethyl and the like. 

The term “heterocyclyr (or heterocyclic, or heterocyclo) refers to both aromatic and 

nonaromatic ring compounds including monocychc, bicycHc, and polycycUc ring compounds 
such as, but not limited to, quinuclidyl, containing 3 or more ring members of which one or more 
) is a heteroatom such as, but not limited to, N, O, and S. Although the phrase “unsubstituted 
heterocyclyl” includes condensed heterocyclic rings such as benzimidazolyl, it does not include 
heterocyclyl groups that have other groups such as alkyl or halo groups bonded to one of the ring 
members as compounds such as 2-methylbenzimidazolyl are substituted heterocyclyl groups. 
Examples of heterocyclyl groups include, but are not limited to: nns atura t ed 3 to 8 membered 
' nngs containing 1 to 4 nitrogen atoms such as, but not limited to pyrrolyl, pytrolinyl, imidazolyl, 
pyrazolyl, pyridyl, dihydropyridyl, pyrimidyl, pyrazinyl, pyridazinyl, triazolyl (e.g. 4H-1,2,4- 
triazolyl, lH-l,2,3-triazolyl, 2H-l,2,3-ttiazolyl etc.), tetrazolyl, (e.g. IH-tetrazolyl, 2H 
tetrazolyl, etc.); saturated 3 to 8 membered rings containing 1 to 4 nitrogen atoms such as, but 
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not limited to, pyrrolidinyl, imidazolidin^, pipeiidinyl, pipanzinyl; condensed 
heterocyclic groups containing 1 to 4 nitrogen atoms such as, but not limited to, indolyl, 
isoindolyl, indolinyl, indolizinyl, ben2amidazolyl, quinolyl, isoquinolyl, indazolyl, 
benzotriazolyl; unsaturated 3 to 8 membered rings containing 1 to 2 oxygen arorng and 1 to 3 
5 nitrogen atoms such as, but not limited to, oxazolyl, isoxazolyl, oxadiazolyl (e.g. 1,2,4- 
oxadiazolyl, 1,3,4-oxadiazolyl, 1,2,5-oxadiazolyI, etc.); saturated 3 to 8 membered rings 
containing 1 to 2 oxygen atoms and 1 to 3 nitrogen atoms such as, but not limited to, 
morpholinyl; unsaturated condensed heterocyclic groups containing 1 to 2 oxygen atmng and 1 to 
3 nitrogen atoms, for example, benzoxazolyl, benzoxadiazolyl, benzoxazinyl (e.g. 2H-1,4- 
10 benzoxazinyl etc.); unsaturated 3 to 8 membered rings containing 1 to 3 sulfur atoms and 1 to 3 
nitrogen atoms such as, but not limited to, thiazolyl, isothiazolyl, thiadiazolyl (e.g. 1,2,3- 
thiadiazolyl, lA4-thiadiazolyl, 1,3,4-thiadiazolyl, 1,23-thiadiazolyl, etc.); saturated 3 to 8 
membered rings containing 1 to 2 sulfur atoms and 1 to 3 nitrogen atoms such as, but not limited 
to, thiazolodinyl; saturated and unsaturated 3 to 8 membered rings containing 1 to 2 sulfur ato ms 
15 such as, but not limited to, thienyl, dihydrodithiinyl, dihydrodithionyl, tetrahydrothiophene, 

tetrahydrothiopyran; unsaturated condensed heterocyclic rings containing 1 to 2 sulfur atoms and 
1 to 3 nitrogen atoms such as, but not limited to, benzothiazolyl, benzothiadiazolyl, 
benzothiazinyl (e.g. 2H-l,4-benzothiazinyl, etc.), dihydrobenzothiazinyl (e.g. 2H-3,4- 
dihydrobenzotbiazinyl, etc.), unsaturated 3 to 8 membered rings containing oxygen atoms such 
10 as, but not limited to furyl; unsaturated condensed heterocyclic rings con taining 1 to 2 oxygen 
atoms such as benzodioxolyl (e.g. 1,3-benzodioxoyl, etc.); unsaturated 3 to 8 membered tings 
containing an oxygen atom and 1 to 2 sulfur atoms such as, but not limited to, dihydrooxathiinyl; 
saturated 3 to 8 membered rings containing 1 to 2 oxygen atoms and 1 to 2 sulfur atoms such as 
1,4-oxathiane; unsaturated condensed rings containing 1 to 2 sulfur atoms such as benzothienyl, 

15 benzodithiinyl; and unsaturated condensed heterocyclic tings containing an oxygen atom and 1 

to 2 oxygen atoms such as benzoxathiinyl. Hetraocyclyl group also include those described 
above in which one or more S atoms in the ring is double-bonded to one or two oxygen atoms 
(sulfoxides and sulfones). For example, heterocyclyl groups include tetrahydrothiophene, 
tetrahydrothiophene oxide, and tetrahydrothiophene 1,1-dioxide. Preferred heterocyclyl groups 
10 contain 5 or 6 ring members. More preferred heterocyclyl groups include morpholine, 

piperazine, piperidine, pyrroUdine, imidazole, pyrazole, 1,23-triazole, 13,4-triazole, tetrazole, 
thiomorpholine, thiomorpholine in which the S atom of the thiomorpholine is bonded to one or 
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more O atoms, pyrrole, homopiperazine, oxazolidin-2-one, pyrrolidin-2-one, oxazole, 
quinuclidine, thiazole, isoxazole, furan, and tetrahydrofiiran. 

ITie phrase “substituted heterocyclyl” refers to an heterocyclyl group as defined above in 
which one of the ring memb^ is bonded to a non-hydrogen atom such as described above with 
5 respect to substituted alkyl groups and substituted aryl groups. Examples, include, but are not 

limited to, 2-methylbenzimidazolyl, 5-methylbenzimidazolyl, S-chlorobenzthiazolyl, 1-methyl 
piperazinyl, and 2-chloropyridyl among others. 

"Aminosulfonyl" refers to the group -S(0)2-NH2. "Substituted aminosulfonyl" refersto 
• the group S(0) 2 -NRR' where R is loweraUcyl and R* is hydrogen or a loweralkyl. The term 
10 "aralkylaminosulfonlyaiyl" refers to the group -aryl-S(O) 2 -^ffl[-aralkyl, where the aralkyl is 
loweraralkyl. 

"Carbonyl" refers to the divalent group -C(0)-. 

"Carbonyloxy" refers generally to the group -C(0)-0-,. Such groups include 
esters; -C(0)-0-R, where R is loweralkyl, cycloalkyl, aryl, or loweraralkyl. The term 
15 "carbonyloxycycloalkyl" refers ^nerally to both an "carbonyloxycarbocycloalkyl" and an 
"carbonyloxyheterocycloalkyl", i.e., where R is a caibocycloalkyl or heteaocycloalkyl,- ' 
respectively. The term "arylcarbonyloxy" refers to the group -C(0)-0-aryl, where aryl is a 
mono- or polycyclic, carbocycloaryl or heterocycloaryl. The term "aralkylcarbonyloxy" refers to 
the group -C(0)-0-aralkyl, where the aralkyl is loweraralkyl. 

20 , The term "sulfonyl" refers to the group -SO 2 -. "Alkylsulfonyl" refers to a substituted 

sulfonyl of the stracture -SO 2 R - in which R is alkyl. Alkylsulfonyl groups employed in 
compounds of the present invention are typicaUy loweralkylsulfonyl groups having from 1 to 6 • 
carbon atoms in its backbone stmcture. Thusj typical alkylsulfonyl groups employed in 
compounds of the present invention include, for example, metiiylsulfonyl (i.e., where R is 
25 methyl), ethylsulfonyl (i.e., where R is ethyl), propylsulfonyl (i.e., where R is propyl), and the 
like. The term "arylsulfonyl" refersto the group -S02-aryl. The term "aralkylsulfonyl" refers to 
, the group -S02-aralkyl, in which the aralkyl is loweraralkyl. The term "sulfonamido" refers to - 
SO 2 NH 2 . 

The term "carbonylamino" refers to the divalent group -NH-C(0)- in which the hydrogen 
30 atom of the amide nitrogen of the carbonylamino group can be replaced a loweralkyl, aryl, or 
loweraralkyl group. Such groups include moieties such as carbamate esters (-NH-C(O)-O-R) and 
amides -NH-C(0)-0-R, where R is a straight or branched chain loweralkyl, cycloalkyl, or aryl 
or loweraralkyl. The term "loweralkylcarbonylamino" refers to alkylcarbonylamino where R is a 
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loweralkyl having firom 1 to about 6 caibon atoms in its bacldmne stmctuie. Tlie tenn 
"arylcarbonylamino" refers to group ^NH-C(0)-R where R is an aryl. Similarly, the term 
"aralkylcaibonylamino " refers to carbonylamino where R is a lower aralkyl. 

The term "guanidino" or "guanidyl" refers to moieties derived from gimniHinft 
H2N-C(=NH>NH2- Such moieties include those bonded at the nitro^n atom carrying the formal 
double bond (the "2"-position of die guanidine, e.g., diaminomethyleneamino, (H2N)2C=NH-) 
and those bonded at either of the nitrogen atoms carrying a formal single bond (the " 1-" and/or 



"3"-positions of the guandine, e.g., H 2 N-C(=NH)-NH-). The hydrogen atoms at any of the 
nitro^ns can be replaced with a suitable substituent, such as loweralkyl, aryl, or loweraralkyl. 

Representative cycloimido and heterocycloimido groups include, for example, those 
shown below. These cycloimido and heterocycloimido can be further substituted and may be 
attached at various positions as will be apparent to those having sIHll in the organic and 
medicinal chemistry arts in conjunction with the disclosure herein. 




O 




Representative substituted amidino and heterocycloamidino groups include, for example, 
10 those shown below. These amidino and heterocycloamidino groups can be further substituted as 
will be apparent to those having skill in the organic and medicinal ch emis try arts in conjunction 
with the disclosure herein. 
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Representative substituted alkylcaibonylamino, alkyloxycarbonylamino, 
arninoalkyloxycatbonylamino, and aiylcaibonylamino groups include, for example, those shown 
5 below. Hiese groups can be further substituted as will be apparent to those having sIHll in the 
organic and medicinal chemistry arts in conjunction with the disclosure herein. 




10 Representative substituted anoinocarbonyl groups include, for example, those shown 

below. These can heterocyclo groups be further substituted as will be apparent to those having 
skill in the organic and medicinal chemistry arts in conjunction with the disclosure herein. 




Representative substituted alkoxycarbonyl groups include, for example, those shown 
below. These alkoxycarbonyl groups can be further substituted as will be qjparent to those 
having skill in the organic and medicinal chemistry arts in conjunction witii the disclosure 
herein. 
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V 



OH ("^O 





5 “Substituted” refers to the definite replacement of hydrogen with one or more 

monovalent or divalent radicals. Suitable substitution groups include, those described herein for 
particular groups, as wefi as hydroxyl, nitro, amino, imino, cyano, halo, thio, thioamido,. 
amidino, imidino, oxo, oxamidino, methoxamidino, imidino, guanidino, sulfonamido, carboxyl, 
formyl, alkyl, substituted alkyl, halolow^alkyl, loweralkoxy, haloloweralkoxy, loweralkoxy- 
10 alkyl, alkylcarbonyl, arylcarbonyl, aralkylcarbonyl, heteroarylcaibonyl, heteroaralkylcarbonyl, . 

alkylthio, aminoalkyl, cyanoalkyl, benzyl, pyridyl, pyiazolyl, pyrrole, thiophene, imidazolyl, and 
the like. 

The term “linking moiety” refers to a covalent bond or an uncyclized divalent group, 
such as, for example, -CO-, -0-, -S-, -CH 2 -, -NH-, and substituted or unsubstituted alkyl, 

15 alkenyl, alkynyl, carbonyl, alkoxycaibonyl groups as defined herein. 

# 

The term “SMIP compoimd” refers to small molecule immunopotentiating compounds, 

’ that include small molecule compounds below about MW 1000 g/mol, preferably MW 800 
g/mol that are capable of stimulating or modulating a pro-inflammatory response in a patient. In 
an embodiment, the SMIP compounds arc able to stimulate human peripheral blood mononuclear 
10 cells to produce cytokines. Preferred SMIP compounds and derivatives thereof include, for 
example, aminoazavinyl compounds, benzazole compotmds, acylpiperazine compounds; 
indoledione compounds, tetrahydroisoquinoline (THIQ) compounds, anthraquinone compounds, 
indanedione compounds, pthalimide compounds, benzocyclodione compounds, 
aminob en zimidazole quinolinone (ABIQ) compounds, hydraphthalimide compounds, 

15 pyrazolopyrimidine compounds, quinazilinone compounds, quinoxaline compounds, triazine 
compounds, tetrahydropynolidinoquinoxaline conopounds, pyrrole compounds, benzophenone 
conq>ounds, sterol compound, and isoxazole compounds. 
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The term “SMIS compound” refers to small molecule immunosuppressant compounds, 
that include small molecule compounds below about about MW 1000 g/mol, preferably MW 800 
g/mol, capable of suppressing or modulating a pro-inflammatory response in a patient 



5 



10 



.5 



Acylpiperazine compounds as described throughout this application include compounds 
of formula (HI) as shown below: 




D4 



ni 

wherein, 

. R9 is selietrted from the group consisting of substituted or unsubstituted aryl, heteroaryl, 
arylalkyl, arylalkenyl, heteroarylalkyl, and heteroarylalkenyl; 

, Rio is substituted or unsubstituted alkyl; 
n is an iiiteger from 0-2; and. 



if Di is carbon than D 2 is oxygen, D 3 is absent and D 4 is selected from the group consisting 



of substituted or unsubstituted aryl, heteroaryl, carbocycyl, alkoxyaryl, fused arylaryl, fused 
arylheteroaryl, and fused heteroarylaryl; or. 



if Di is. nitrogen than D 2 is nitrogen, D 4 is absent and D 3 is selected from the group 
consisting of substituted or unsubstituted aryl, heteroaryl, carbocycyl, alkoxyaryl, fused 
arylaryli fused arylheteroaryl, and fused heteroarylaryl, 

« 

Ihdoledione compounds as described throughout this application include compounds of 
formula (IV) as shown below: 



IV 

5 wherein. 
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Rii and Rn are indepradently selected firom the group consisting of H, nitro, halogen, 
amino, hydroxy, cyano, carboxcyclic add, and substituted or unsubstituted alkyl, aryl, 
heteroaiyl, alkoxy, alkylcarbonyl, alkylcaibonylamino, alkylaminocarbonyl, 
aminocaibonyl, arylalkoxy, heteroarylalkoxy, alkylamino, arylalkylamino, arylamino, 
heteroarylamino, heteroarylaminoallqrl, heterocyclyl, heterocyclylalkoxy, 
heteiocyclylalkyl, andcarbocyclyl groups; and, 

Ri 3 is selected from the group consisting of substituted or unsubstituted aryl, heteroaryl, 
aiylalkyl, heteroarylalkyl, heterocyclyl, heterocyclylalkyl, and alkylbenzyl. 



10 



5 



0 



5 



Tetrahydroisoquinoline (THIQ) conq)ounds as described throughout this application 
include compounds of formula (V) as shown below: 




wherein, 

L is a covalent bond or selected from the group consisting of -CH2-, -CO-, -0-, -S-, CHF, 
-NH-, -NR2(r, where Rjo is lower alkyl; 

Ri 4 is selected from the group consisting of hydrogen, halogen, and substituted or 
unsubstituted alkyl; 

Ri 5 is selected from the group consisting of substituted or unsubstituted carbocyclyl, aryl, 
arylalkyl, alkoxyaryl, heteroaryl, heterocyclyl; 

Ri 6 is selected from the group consisting of hydrogen, halogen, and substituted or 
, unsubstituted alkyl; 

Ri 7 is selected from the group consisting of hydrogen, halogen, and substituted or 
unsubstituted alkyl; and, 

Ri 8 and R19 are independently selected from the group consisting of H, hydroxy, halogen, 
alkoxy, amino, unsubstituted alkyl, substituted alkyl, and alkylamino. 



Benzocyclodione compounds as described throughout this application include 
) compounds of formula (VI) as shown below: 
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VI 



wherein, 

E is selected from the group consisting of NRas or CR26Rz7; 

R2i> R23> 2nd R24 are independently selected from the group co nsis ting of H, hydroxy, 
halogen, alkoxy, amino, unsubstituted alkyl, substituted alkyl, and alkylamino; 

R22 is selected from the group consisting or H, hydroxy, halogen, alkoxy, amin o, and 
unsubstituted or substituted alkyl, and alkylamino, arylalkyl, heteroarylalkyl, aryl, 
heteroaryl, arylcarbonyl, heterocyclyl, heterocyclylalkyl, and heteroarylcarbonyl; 

R25 is selected from the group consisting of substituted or unsubstituted aryl, heteroaryl, 
heterocyclyl, carbocyclyl, arylalkyl, heteroarylalkyl, and heterocyclyalkyl; 

R26 is selected from the group consisting of H, halogen, hydroxy, amin o, and substituted 
or unsubstituted alkyl, carbonylalkyl, and alkylcarbonylalkyl; and, 

R27 is selected from the group aryl, arylalkyl, heteroarylalkyl, heterocyclyl, 
heterocyclylalkyl, carbocyclyl, arylcarbonylalkyl, and arylalkylcarbonyl. 




ZO 



15 



Aminoazavinyl compounds as described throughout this application include compounds 
of formula (VH) as shown below: ’ • 



^ 28 ^ 




vn 



wherein, 

GisdtherSorNH; 

R28 is selected from the group consisting of H, and substituted or unsubstituted alkyl, 
aryl, heteroaryl, heteroarylalkyl, arylalkyl, carbocyclyl, carbocyclylalkyl, heterocyclyl, 
and heterocyclylalkyl; 

Q is selected from the group consisting of hydrogen, substituted alkyl, unsubstituted alkyl. 



and aryl, substituted aryl, heteroaryl, substituted heteroaryl, heterocyclyl, substituted 
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heterocyclyl, fused or unfused arylaryl, substituted arylaryl, arylhetrao^l, substituted 
arylheteroaryl, heteroarylheteroaryl, and substituted heteroar^eteroaryl; 

Vi is selected from the group consisting of all^l, substituted alkyl, aryl, substituted aryl, 
arylalkyl, substituted arylalkyl, heteroaryl, substituted heteroaryl, heteroarylalkyl, 

5 substituted heteroarylalkyl, alkoxy, substituted alkoxy, aminocarbonyl, alkoxycarbonyl, 

carboxyl sulfonyl, methanesulfonyl, and substituted or unsubstituted alkylcarbonyl, 
arylcarbonyl, aralkylcatbonyl, heteroarylcarbonyl, heteroaralkylcarbonyl, 
alkylcarbonyloxy, arylcarbonyloxy, aralkylcarbonyloxy, heteroarylcarbonyloxy, 
heteroaralkylcarbonyloxy, alkylaminocaibonyloxy, arylaminocarbonyloxy, formyl, 

10 loweralkylcarbonyl, loweralkoxycarbohyl, aminocarbonyl, aminoaryl, aUcylsulfonyl, 

suLfonamido, aminoalkoxy, alkylamino, heteroarylamino, alkylcarbonylamino, 

♦ I * . . 

alkylaininocarbonylainmo, arylaminocatbonylamino, araUcylcarbonylamino, 
heteroarylcaibonylamino, arylcarbonylamino, cycloamidino, cycloalkyl, cycloimido, 
arylsulfonyl and arylsulfonamido; and, 

15 V 2 is selected from the group consisting of hydrodgen, halogen, alkyl, substituted alkyl, 

aryl, substituted aryl, arylalkyl, substituted arylalkyl, heteroaryl, substituted heteroaryl, 
heteroarylalkyl, substituted heteroarylalkyl, alkoxy, substituted alkoxy, aminocarbonyl, 
alkoxycarbonyl, carboxyl sulfonyl, methanesulfonyl, and substituted or unsubstituted 
alkylcarbonyl, arylcarbonyl, aralkylcatbonyl, heteroarylcarbonyl, heteroaralkylcarbonyl, 
50 alkylcarbonyloxy, arylcarbonyloxy, aralkylcarbonyloxy, heteroarylcarbonyloxy, ' 

heteroaralkylcarbonyloxy, alkylaminocarbonyloxy, arylaminocarbonyloxy, formyl, 
loweralkylcarbonyl, loweralkoxycarbonyl, aminocarbonyl, aminoaryl, aUcylsulfonyl, 
sulfonamido, aminoalkoxy, alkylamino, heteroarylamino, alkylcarbonylamino, 
allQ’laminocarbonylamino, arylaminocarbonylamino, aralkylcarbonylamino, 

55 heteroarylcarbonylamino, arylcarbonylamino, cycloamidino, cycloalkyl, cycloimido, 

arylsulfonyl and arylsulfonamido. 

Lactam compounds as described throughout this application include compounrk of 
formula (Vni) as shown below: 
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vm 



wherein, 

Wi is selected from the group consisting of -OH, -OR36 groups, -NR37R38; 

W2 is selected from the group consisting of O, S, and NR39 groups; 

R29 and R30 join to form a 5 to 6 membered substituted or unsubstituted ring comprising 
all carbon atoms or at least one O, N, or S atom; 



R35 and R39 may be the same or different and are selected from die group consisting of H, 
-OH substituted and unsubstituted alkyl groups, substituted and linsubstituted aryl 
groups, -C(=0)H, -C(=0)-alkyl groups, and -C(=0)-aryl groups; 

Raij R32, R33J and R34 may be the same or different and are independently selected from 
the group consisting of H, Cl, Br, F, I, -NO2, -CN, -OH, -OR40 groups, -NR41R42 groups, 
-C(=0)R43 groups, -SH groups, substituted and unsubstituted amidinyl groups, 
substituted and unsubstituted guanidinyl groups, substituted and unsubstituted alkyl 
groups, substituted and unsubstituted aryl groups, substituted and unsubstituted alkenyl 
groups, substituted and unsubstituted alkynyl groups, substituted and unsubstituted 
heterocyclyl groups, substituted and unsubstituted alkylaminoalkyl groups, substituted 
and unsubstituted dialkylaminoalkyl groups, substituted and unsubstituted arylaminoalkyl 
groups, substituted and unsubstituted diarylaminoalkyl groups, substituted and 
unsubstituted (alkyl)(aryl)aminoalkyl groups, substituted and unsubstituted 
heterocyclylalkyl groups, substituted and unsubstituted aminoalkyl groups, substituted 
and unsubstituted heterocyclylaminoalkyl groups, substituted and unsubstituted 
diheterocyclylaminoalkyl groups, substituted and unsubstituted 
(alkyl)(heterocyclyl)aminoalkyl groups, substituted and unsubstituted 
(aryl)(heterocyclyl)aminoalkyl groups, substituted and unsubstituted hydroxyalkyl 
groups, substituted and unsubstituted alkoxyalkyl groups, substituted and unsubstituted 
aryloxyalkyl groups, and substituted and unsubstituted heterocyclyloxyalkyl groups; 

R36 is selected from the group consisting of substituted and unsubstituted alkyl groups, 
substituted and unsubstituted aryl groups, substituted and unsubstituted heterocyclyl 
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groups, substituted and unsubstituted heterocyclylalkyl groups, -C(=0)H, -C(=0)-alkyl 
groups, -C(=0)-aryl groups, -C(=0)0-alkyl groups, -C(=0)0-aryl groups, -C(=0)NH2, - 
C(=0)NH(alkyl) groups, -C(=0)NH(aryl) groups, -C(=0)N(alkyl)2 groups, - 
C(=0)N(aryI)2 groups, -C(=0)N(alkyl)(aryl) groups, -NH 2 , -NH(alkyl) groups, -NH(aryI) 
5 groups, -N(alkyl )2 groups, -N(alkyl)(aryl) groups, -N(aryl )2 

groups, -C(=0)NH(heterocyclyl) groups, -C(=0)N(heterocyclyl)2 
groups, -C(=0)N(alkyl)(heterocyclyl) groups, and -C(=0)N(aryl)(heterocyclyl) groups; 
R37 is selected from the group consisting of H, substituted and unsubstituted alkyl groups, 
substituted and imsubstituted aryl groups, and substituted and unsubstituted heteiocyclyl 
10 groups; 

R 38 is selected from the group consisting of H, substituted and imsubstituted alkyl groups, 
substituted and unsubstituted aryl groups, substituted and unsubstituted heteiocyclyl 
groups, -OH, alkoxy groups, aryloxy groups, -NH 2 , substituted and unsubstituted 
heterocyclylalkyl groups, substituted and unsubstituted aminoalkyl groups, substituted 
15 and unsubstituted alkylaminoalkyl groups, substituted and unsubstituted 

dialkylaminoalkyl groups, substituted and unsubstituted arylaminoalkyl groups, 
substitute and unsubstituted diarylaminoalkyl groups, substitute and unsubstitute 
(alkyl)(aryl)aminoalkyl groups, substituted and unsubstitute alkylamino groups, 
substitute and unsubstitute arylamino groups, substitute and unsubstitute 

10 diecylamino groups, substitute and unsubstituted diarylamino groups, substitute and 

unsubstitute (alkyl)(aryl)amino groups, -C(=0)H, -C(=0)-alkyl groups, -C(=0)-aryl 
groups, -C(=0)0-alkyl groups, -C(=0)0-aryl groups, -C(=0)NH2, -C(=0)NH(alkyl) 
groups, -C(=0)NH(aryl) groups, -C(=0)N(alkyl)2 groups, -C(=0)N(aryl)2 
groups, -C(=0)N(alkyl)(aryl) groups, -C(=0)-heterocyclyl groups, -C(=0)-0- 
55 heteiocyclyl groups, -C(=0)NH(heterocyclyl) groups, -C(=0)-N(heterocyclyl)2 groups, - 

C(=0)-N(alkyl)(heterocyclyl) groups, -C(=0)-N(aryl)(heterocyclyl) groups, substitute ’ 
and unsubstitute heteiocyclylaminoalkyl groups, substitute and unsubstitute 
diheterocyclylaminoalkyl groups, substitute and unsubstitute 
(alkyl)(heterocyclyl)aminoaIkyl groups, substitute and unsubstitute 
0 (aryl)(heterocyclyl)aminoalkyl groups, substitute and unsubstitute hydroxyalkyl 

groups, substitute and unsubstitute alkoxyalkyl groups, substitute and unsubstitute 
aryloxyalkyl groups, and substitute and unsubsdtute heterocyclyloxyaDcyl groups; 
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R41 is selected fconi tbe group consisting of H, substituted &nd unsubstituted alVyl groups^ 
substituted and unsubstituted aryl groups, and substituted and unsubstituted heterocyclyl 
groups; 

R42 is selected from the group consisting of H, substituted and unsubstituted alkyl groups, 
5 substituted and unsubstituted aryl groups, substituted and unsubstituted heterocyclyl 

groups, -C(=0)H, -C(=0)-alkyl groups, -C(=0)-aryl groups, - 
C(=0)NH2, -C(=0)NH(alkyl) groups, -C(=0)NH(aryl) groups, -C(=0)N(alkyl)2 groups, 
-C(=0)N(aryl)2 groups, -C(=0)N(alkyl)(aryl) groups, -C(=O)0-alkyl 
groups, -C(=0)O-aryl groups, substituted and unsubstituted aminoalkyl groups, 

10 substituted and unsubstituted alkylaminoalkyl groups, substituted and unsubstituted 

dialkylaminoalkyl groups, substituted and unsubstituted arylaminoalkyl groups, 
substituted and unsubstituted diarylaminoalkyl groups, substituted and unsubstituted 
(alkyl)(aryl)aminoalkyl groups, substituted and unsubstituted heterocyclylalkyl groups, - 
C(=0)-heterocyclyl groups, -C(=0)-0-heterocyclyl groups, -C(=0)NH(heterocyclyl) 

.5 groins, -C(=0)-N(helerocyclyl)2 groups, -C(=0)-N(alkyl)(heterocyclyl) groups^ -C(=0> 

N(aryl)(heterocyclyl) groups, substituted and unsubstituted heterocyclylaminoalkyl 
groups, substituted and unsubstituted diheterocyclylaminoalkyl groups, substituted and 
unsubstituted (heterocyclyl)(alkyl)aminoalkyl groups, substituted and unsubstituted 
(heterocyclyl)(aryI)aminoalkyl groups, substituted and unsubstituted hydroxyalkyl 
® groups, substituted and unsubstituted alkoxyalkyl groups, substituted and unsubstituted 

aryloxyalkyl groups, and substituted and unsubstituted heterocyclyloxyalkyl groups; and 
R43 is selected from the group consisting of H, -NH 2 , -NH(alkyl) groups, -NH(aryl) 
groups, -N(alkyl )2 grorqrs, -N(aryl )2 groups, -N(alkyl)(aryl) groups, -NH(heterocyclyl) 
groups, -N(heterocyclyl)(alkyl) groups, -N(heterocyclyl)(aryl) groups, -N(heterocyclyl )2 
•5 groups, substituted and unsubstituted alkyl groups, substituted and unsubstituted aryl 

groups, -OH, substituted and unsubstituted aUcoxy groups, substituted and unsubstituted 
heterocyclyl groups, substituted and unsubstituted aryloxy groups, heterocyclyloxy 
groups, -NHOH, -N(alkyl)OH groups, -N(aryl)OH groups, -N(aIkyl)0-alkyl groups, - 
N(aryl)0-alkyl groups, -N(alkyl)0-aryl groups, and -N(aryl)0-aryl groups. 

0 

Preferably R 29 and R30 join together to form a substituted or unsubstituted phenyl ring. 
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10 



15 



Hydropthalamide con^)oimds as described throu^out this application include 
compounds of foimula (DQ as shown below; 




wherein, 

R44 is selected from the group consisting of substituted or unsubstituted aryl, heteroaryl, 
arylalkyl, heteroarylalkyl, fused arylaryl, unfused arylaryl, fused heteroarylaryl, unfiised 
heteroarylaryl, fused arylheteroaryl, and unfused arylheteroaryl; 

, R45, R47..R49. and R51 may be the same or different and are independently selected from 
the group consisting of H, nitro, halogen, amino, hydroxy, cyano, carboxcyclic acid, and 
. substituted or unsiAstituted alkyl, aryl, heteroaryl, alkoxy, aUcylcarbonyl, 
alkylcarix)nylamino, alkylaminocarbonyl, aminocarbonyl, arylalkoxy, heteroarylalkoxy, 
alkylamino, arylalkylamino, arylammo, heteroarylamino, heteroarylaminoalkyl, 
heterocyclyl, heterocyclylalkoxy, heterocyclylalkyl, and carbocyclyl; and 
R46, R48>. R-5 o> and Rs2 may be the same or different and are independently selected from 
the group consisting of H, halogen, and substituted or unsubstituted alkyl groups. 



20 



25 



Benzdphenone compounds as described throughout this application include compounds 
of formula (X) as shown below: 




wherein, 

R53 is independently selected from the group consisting of H, nitro, halogen, amino, 
hydroxy, cyano, carboxcyclic acid, and substituted or unsubstituted alkyl, aryl, 
heteroaryl, alkoxy, aUcylcarbonyl, alkylcarbonylamino, alkylaminocarbonyl, 
aminocarbonyl, arylalkoxy, heteroarylalkoxy, alkylamino, arylalkylamino, arylamino, 
heteroarylamino, heteroarylaminoalkyl, heterocyclyl, heterocyclylalkoxy, 
heterocyclylalkyl, and carbocyclyl; 
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R54 is independently selected from the group consisting of H, nitro, halogen, amino, 
hydroxy, cyano, caiboxcyclic acid, and substituted or unsubstituted alkyl, aryl, 
heteroaryl, alkoxy, alkylcarbonyl, alkylcaibonylamino, alkylaminocarbonyl, 
aminocarbonyl, arylalkoxy, heteroarylalkoxy, alkylamino, arylalkylamino, arylamino, 
5 heteroaiylamino, heteroarylaminoalkyl, heterocyclyl, heterxjcyclylalkoxy, 

heterocyclylalkyl, and carbocyclyl; and 
o and p are integers from 0-4. 



Isoxazole compounds as described throughout this q>plication include compounds of 
10 formula (XI) as shown below: 




wherein, 

R 55 is selected from the group consisting of substituted or unsubstituted aryl, arylalkyl, 
heteroaryl, heteroarylalkyl, heterocyclyl, and heterocyclylalkyl; 

Rs 6 is selected from the group consisting of substituted or unsubstituted aryl, arylalkyl, 
heteroaryl, heteroarylallqrl, heterocyclyl, and heterocyclylalkyl; and, 

R57 is selected from the group consisting of H, halogen, hydoxy, and substituted or 
unsubstituted aUcyl, aryl, heteroaryl, heterocyclyl, and carbonyl. 

Sterol compounds as described throughout this application include compounds of 
formula (XU) as shown below; 




5 wherein, 

Rs 8 is selected from the group consisting of substituted or unsubstituted aryl, arylalkyl, 
heteroaryl, heteroarylalkyl, heterocyclyl, and heterocycl ylalky l 
Preferably Rsg is a pyranone substituent. 



0 
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Qui n azil in one coiqpounds as described throughout this application include con^wunds of 
fonnula (XlU) as shown below: 




xm 

wherein, 

R59 is selected from the group consisting of H, halogai, hydroxy, and substituted or 
unsubstituted alkyl, aminoalkyl, alklyaminoalkyl, alkoxy, dialkylaminoalkyl. 



hydroxyalkyl, alkenyl, alkynyl, carbocyclyl, carbocyclylalkyl, heterocyclyl, and 
heterocyclylalkyl; 

R«o is selected from the group consisting of substituted or unsubstituted aryl, heteioaryl, 
arylalkyl, heteroarylalkyl, and heterocyclylalkyl; and, 

Reu R62, R 63 . and R« may be the same or different and are independently selected from 
the group consisting of H, nitro, halogen, amino, hydroxy, cyano, caiboxcyclic add, and 
substituted or unsubsdtuted alkyl, aryl, heteroaryl, alkoxy, alkylcarbonyl, 
alkylcarbonylamino, alkylaminocarbonyl, aminocarbonyl, airylalkoxy, heteroarylalkoxy, 
alkylamino, arylalkylamino, arylammo, heteroarylamino, heteroarylaminoalkyl, 
heterocyclyl, heterocyclylalkoxy, heterocyclylalkyl, and carbocyclyl groups. 






5 



Pyrrole compounds as described throughout this application include compounds of 
formula (XIV) as shown below: 




wherein. 

Res is selected from the group consisting of H, hydroxy, and substituted or unsubstituted 
alkyl, aryl, heteroaryl, heteroarylalkyl, arylalkyl, heteroarylaminoalkyl, arylaminoalkyl, 
heteroaryloxyalkyl, and aryloxyalkyl groups; 

Rce. R« 7 , R<i 8 , and R^q may be the same or different and are independently selected from 

the group consisting of H, nitro, halogen, amino, hydroxy, cyano, carboxcyclic add, and 
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substituted or unsubstituted alkyl, aryl, heteroaryl, alkoxy, alkylcarbonyl, 
aDcylcarbonylanuno, alkylaminocaibonyl, aminocaibonyl, arylalkoxy, heteroarylalkoxy, 
alkylamino, aiylalkylamino, arylamino, heteroarylamino, heteroaiylaniinoalkyl, 
heterocyclyl, heterocyclylalkbxy, heterocyclylalkyl, and caibocyclyl groups. 

5 

Further preferred pyrrole compounds include those shown in Formula (XV): 




wherein: 

Ki is nitrogen, oxygen, or optionally substituted carbon; 

W is absent or is selected from the group consisting of-O-, -S-, -S(0)-, -SO 2 -, -NH-, -NH- 
.5 CO-, -NR'CO-, -NHSO 2 -. -NR'S 02 -, -CO-, -CO 2 -, -CH 2 -, -CF 2 -. CHF, -CONH-, -CONR’-, 
and —NR -, where R' is alkyl, substituted aUcyl, cycloalkyl, aryl, heteroaryl, heterocyclo; 

Ar is optionally substituted aryl, heteroaryl, or a protectihg group; 

R70 and R 70 ' are independently selected fiom the group consisting of hydrogen and methyl; 

R71 , R72. R73. and R 74 are independently selected fiom the group consisting of hydro^n, 

•0 hydroxyl, and optionally substituted loweraUcyl, cycloloweralkyl, cycUcaminoalkyl, alkyl- 
aminoalkyl, loweralkoxy, amino, alkylamino, alkylcarbonyl, arylcaibonyl, aralkylcarbonyl, 
heteroarylcaibonyl, heteroaralkylcarbonyl, aryl and heteroaryl; 

R75 and R 78 are independently selected fiom the group consisting of hydrogen, halo, and 
optionally substituted lowearalkyl, cycloalkyl, alkoxy, amino, aminoalkoxy, carbonyloxy, 

5 aminocarbonyloxy, alkylcarbonylamino, arylcaibonylamino, aralkylcarbonylamino, hetero- 
arylcaibonylamino, heteroaralkylcarbonylamino, cycloimido, heterocycloimido, amidino, 
cycloamidino, heterocycloamidino, guanidinyl, aryl, heteroaryl, heterocycloalkyl, 
heterocyclocarbonyloxy, heteroarylcarbonyloxy, and arylsulfonamido; 

R 76 is selected fiom the group consisting of hydrogen, aryl, heteroaryl, substituted heteroaryl, 

0 heterocyclyl, and substituted heterocyclyl; 
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R 77 is selected from the group consisting of hydrogen, hydroxy, halo, caiboxyl, nitro, amino, 
amido, amidino, imido, cyano, sulfonyl, methanesulonyl, and substituted or unsubstituted alkyl, 
alkoxy, alkylcaibonyl, arylcarbonyl, aralkylcarbonyl, heteroarylcaibonyl, heteroaralkylcaibonyl, 
alkylcaibonyloxy, arylcarbonyloxy, aralkylcaibonyloxy, heteroarylcaibonyloxy, 

5 heteroaialkylcarbonyloxy, alkylaminocarbonyloxy, arylaminocarbonyloxy, formyl, 
loweralkylcarbonyl, loweralkoxycarbonyl, aminocaibonyl, aminoaryl, alkylsulfonyl, 
sulfonanudo, aminoaDcoxy, alkylamino, heteroarylamino, alkylcarbonylamino, 
alkylaminocarbonylamino, arylaminocarbonylamino, aialkylcaibonylamino, 
heteroarylcarbonylamino, arylcaibonylamino, heteroarylcarbonylamino cycloamido, 

10 cyclothioamido, cycloamidino, heterocycloamidino, cycloalkyl, cycloimido, heterocycloimido, 
guanidinyl, aryl, heteroaryl, heterocyclo, heterocycloalkyl, arylsulfonyl and arylsulfonamido; 



Anthraquinone compounds of the instant invention include, for example, coxopounds of 
Formula (XVI): 



.5 




Rs4 

Rs3 






5 



wherein. 



XVI 



R 79 . Rgo, R«i, and Rg 2 may be the same or different and are independently selected from 
the group consisting of H, nitro, halogen, amino, hydroxy, cyano, carboxcyclic acid, and 
substituted or unsubstituted alkyl, aryl, heteroaryl, alkoxy, alkylcarbonyl, 
alkylcarbonylamino, sulfonyl, aminosulfonyl, alkylaminocarbonyl. aminocarbonyl, 
arylalkoxy, heteroarylalkoxy, alkylamino, arylalkylamino, arylamino, heteroarylamino, 
heteroarylaminoalkyl, heterocyclyl, heterocyclylalkoxy, heterocyclylalkyl, and 
carbocyclyl groups; and, 

R83 and Rs 4 are taken together to form a substituted or unsubstituted 5-6 membered ring 
containing all carbon atoms or 1-2 heteroatoms selected from the group consisting of O, 
S, and N. 
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Quinoxaline compounds referred to flirou^out this q>plication include tricyclic, partially 
unconjugated compounds optionally substituted widi nitrogen heteroatoms as shown in the 
preferred quinoxaline embodiment (XVII) below: 




whraein, 

Ji is either Cor N, 

Ji’ is selected firom the group consisting of H, substituted aryl, unsubstituted aryl, 
substituted heteioaryl, and.unsubstituted heteroaryl; 

10 J2 is either C or N, 

h’ is selected firom the group consisting of H, substituted aryl, unsubstituted aryl, 
substituted heteroaryl, and unsubstituted heteroaryl; 

J3 is selected j&om the group consisting of -CO-, -NH-, and -N=; 
if J 4 is -O- then J 4 ’ is absent; or, 

15 if J 4 is =C- then J 4 ’ is selected from the group consisting of H and substituted or 

unsubstituted aUcyl, alkoxy, aryl, heteroaryl, heteroarylalkyl, arylalkyl, aminoalkyl, 
ahcylanaino, and alkyltfaio groups; and, 

R«5. Rsfo Rs 7 . Rss. Rgg may be the same or different and are independently selected 
from the group consisting of H, nitro, halogen, amino, hydroxy, cyano, caiboxcyclic acid, 
iO and substituted or unsubstituted alkyl, aryl, heteroaryl, alkoxy, alkylcarbonyl, 

alkylcaibonylamino, sulfonyl, aminosulfonyl, alkylaminocaibonyl, aminocarbonyl, 
arylalkoxy, heteroarylalkoxy, alkylammo, arylalkylamino, arylamino, heteroarylamino, 
heteroarylaminoalkyl, heterocyclyl, heterocyclylalkoxy, heterocyclylalkyl, and 
carbocyclyl groups. 

>5 

Triazine compounds refer to substituted 6-membered heterocyclic groups with 3 nitrogen 
atoms distributed throughout the ring. The preferred embodiments of the instant invention 
include those shown in structures (XVm), (XEX) and (XX) shown below: 
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wheiein, 

R90 is selected ficom the group consisting of substituted or unsubstituted alkyl, alkenyl, 
akynyl, aryl, heteroaryl, heteroarylalkyl, heteroaiylalkenyl, arylalkyl, and arylalkenyl; 
R91 and R93 are independently selected from the group consisting of H, and unsubstituted 
alkyl; 

R91 is aryl; prefMably phenyl. 



10 



15 



20 




xrx 

wherein, 

R94 is selected from the group consisting of H, amino, alkyl, aminoalkyl, and halogen; 

R9S is selected from the group consisting of substituted or unsubstituted aryl, arylamino, 
arylalkylamino, heteroaryl, heteroarylamino, and het^oalkylamino; ' 

R96 and R97 are independently selected from the group consisting of H, halogen, and 
alky], preferably methyl; or, 

R96 may form a double bond with the nitrogen atom directly below it as indicated by the - 
dashed line in the above structure; and. 



O 




wherein. 
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R 98 is selected from the group consisting of H, substituted alkyl, and unsubstituted alkyl; 
preferably methyl, 

R99 is selected from the group consisting of H, substituted alkyl, and unsubstituted aDcyl; 
preferably ethyl, 

5 Rioo is selected from the group consisting of substituted or unsubstituted aryl, heteroaryl, 

alkoxyaryl, arylalkyl, and heteroarylalkyL 



10 



15 



10 



Benzazole compounds as described throughout this application include compounds of 
formula (XXI) as shown below; 



.N 



T 



W 







XXI 



wherein, 

A is selected from the group consisting of -0-, -S-, -NH-, and -NRg-; 

W is selected from the group consisting of -CH 2 -, -0-, -S-, -NH-, and -NR«-; 

R? is selected from die group consisting of carbocyclyl, unfused carbocyclylcarbiDcyclyl, 
substituted aryl, unsubstituted aryl, substituted heteroaryl, unsubstituted heteroaryl, 
substituted fused arylheteroaryl, imsubstituted fused arylheteroaryl, substituted nnfnsftH 
arylaryl and unsubstituted unfiised arylaryl; 

R^ is selected from the group consisting of substituted or unsubstituted aryl, and 
heteroaryl; and, 

Rg is independently substituted or unsubstituted alkyl. 



Pyrazalopyrimidine compounds as described throughout this application include 
15 compounds of formula (XXII) as shown below: 




xxn 
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whaein, 

Rioi is selected from the group consisting of H, nitro, halogen, amino, hydroxy, cyano, 
carboxcyclic add, and substituted or unsubstituted alkyl, aryl, heteroaiyl, alkoxy, 
alkylcaibonyl, alkylcaibonylamino, sulfonyl, aminosulfonyl, alkylaminocarbonyl, 

5 aminocaibonyl, aiylalkoxy, heteroarylalkoxy, alkylamino, arylalkylamino, arylamino, 

heteioarylamino, heteaoaiylaminoalkyl, heterocyclyl, heteiocyclylalkoxy, 
heterocyclylalkyl, and carbocyclyl groups; 

Rio 2 is selected from the group consisting of H, nitro, halogen, amino, hydroxy, cyano, 
carboxcyclic add, and substituted or unsubstituted alkyl, aryl, heteroaryl, alkoxy, 

10 alkylcarbonyl, alkylcarbonylamino, alkylaminocarbonyl, aminocarbonyl, arylalkoxy, 

heteroarylalkoxy, alkylamino, arylalkylamino, arylamino, heteroarylamino, 
heteroarylaminoalkyl, heterocyclyl, heterocyclylalkoxy, heterocyclylalkyl, and 
carbocyclyl groups; 

Rio 3 is selected j&om the group consisting of H, nitro, halogen, amino, hydroxy, cyano, ■ 
15 carboxcyclic acid, trifluoromethyl, and substituted or unsubstituted alkyl, aryl, heteroaryl, 

alkoxy, alkylcarbonyl, alkylcarbonylamino, alkylaminocarbonyl, aminocarbonyl, 
arylalkoxy, heteroarylalkoxy, alkylamino, arylalkylamino, arylamino, heteroarylamino, 
heteroarylaminoalkyl, heterocyclyl, heterocyclylalkoxy, heterocyclylalkyl, and 
carbocyclyl groups; 

Rio 4 is selected from the group consisting of H and substituted or unsubstituted aryl, 
heteroaryl, arylalkoxy, heteroarylalkoxy, arylalkylamino, arylamino, heteroarylamino, 
heteroarylaminoalkyl, heterocyclyl, heterocyclylalkoxy, heterocyclylalkyl, 
carbocyclylalkyl and carbocyclyl groups; 

Rios is selected from the group consisting of H and substituted or unsubstituted aryl, 
heteroaryl, arylalkoxy, heteroarylalkoxy, arylalkylamino, arylamino, heteroarylamino, 
heteroarylaminoalkyl, heterocyclyl, heterocyclylalkoxy, heterocyclylalkyl, 
carbocyclylalkyl and carbocyclyl groups; 
wherein at least one of Rio* and Rios is not R 

SMIP compounds identified by in-vitro (cellular or non-cellular assays) or in-vivo 
methods are thorou^y described in Nfcthods 1 and 2 below. 



>5 
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PhanDsceuticdl compositions contsioing the compounds of the invention may be in any 
foim suitable for the intended method of administration, including, for example, a solution, a 
suspension, or an emulsion. liquid carriers are typically used in preparing solutions, 
suspensions, and emulsions. liquid carriers contemplated for use in the practice of the present 
5 invention include, for example, water, saline, pharmaceutically acceptable organic solvent(s), 

pharmaceutically acceptable oils or fats, and the like, as well as mixtures of two or more thereof. 
The liquid carrier may contain other suitable pharmaceutically acceptable additives such as 
solubilizers, emulsifiers, nutrients, buffers, preservatives, suspending agents, thickening agents, 
viscosity regulators, stabilizers, and the like. Suitable organic solvents include, for example, 

10 monobydric alcohols, such as ethanol, and polyhydric alcohols, such as glycols. Suitable oils 
include, for example, soybean oil, coconut oil, olive oil, safflower oil, cottonseed oil, and the 
like. For parenteral administration, the carrier can also be an oily ester such as ethyl oleate, 
isopropyl myristate, and the like. Compositions of the present invention may also be in the form • 
of microparticles, mictocapsules, liposomal encapsulates, and the like, as well as combinations 
15 of any two or more thereof. 

Other additives include immunostimulatory agents known in the art. hnmunostimulatOTy 
oligonucleotides and polynucleotides are described in PCT WO 98/55495 and PCT WO 
98/16247. U.S. Patent Application No. 2002/0164341 describes adjuvants including an 
unmethylated CpG dinucleotide (CpG ODN) and a non-nucleic acid adjuvant U.S. Patent 
10 Application No. 2002/0197269 describes compositions comprising an antigen, an antigenic 

CpG-ODN and a polycationic polymer. Other immunostimulatory additives described in the art ■ 
may be used, for example, as described in U.S. Patent No. 5,026,546; U.S. Patent No. 4,806,352; 
and U.S. Patent No. 5,026,543. 

A controlled release delivery system may be used, such as a diffusion controlled matrix • 
15 system or an erodible system, as described for example in: Lee, "Diffusion-Controlled Matrix 
Systems”, pp. 155-198 and Ron and Danger, “&odible Systems”, pp. 199-224, in “Treatise on 
ControDed Drag Delivery”, A. Kydonieus Ed., Marcel Dekker, Inc., New York 1992. The 
matrix may be, for example, a biodegradable material that can degrade spontaneously in situ and 
in vivo for, example, by hydrolysis or enzymatic cleavage, e.g., by proteases. The delivery 
iO system may be, for example, a naturally occurring or synthetic polymer or copolymer, for 
example in the form of a hydrogel. Exemplary polymers with cleavable 1inVagt>g include 
polyesters, polyorthoesters, polyanhydrides, polysaccharides, poly(phosphoesters), polyamides, 
polyurethanes, poly(imidocarbonates) and poly^hosphazenes). 
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TTie compounds of the invention may be administered entmnlly, orally, paienteially, 
sublingually, by inhalation spray, rectally, or topically in dosage unit formulations containing 
convrational nontoxic pharmaceutically acceptable carriers, adjuvants, and v^cles as desired. 
For example, suitable modes of administration include oral, subcutaneous, transdermal, 

5 transmucosal, iontophoietic, intravenous, intramuscular, intraperitoneal, intranasal, subdermal, 
rectal, and the like. Topical administration may also involve the use of transdermal 
administration such as transdermal patches or ionophoresis devices. The term parenteral 
includes subcutaneous injections, intravenous, intramuscular, intrastemal injection, or infusion 
techniques. 

10 Injectable preparations, for example, st^e injectable aqueous or oleaginous suspensions 

may be formulated according to the known art using suitable disp^ing or wetting agents and 
suspending agents. The sterile injectable preparation may also be a sterile injectable solution or 
suspension in a nontoxic parenterally acceptable diluent or solvent, for example, as a solution in 
13-propanediol. Among the acceptable vehicles and solvents that may be employed are water, 

15 Ringer's solution, and isotonic sodimn chloride solution. In addition, sterile, fixed oils are 

conventionally employed as a solvent or suspending medium. For this purpose any bland fixed 
oil may be employed including synthetic mono- or diglycerides. In addition, fatty acids such as 
oleic acid find use in the preparation of injectables. 

Suppositories for rectal administration of the drug can be prepared by mixing the drug 
20 . with a suitable nonirritating excipient such as cocoa butter and polyethylene glycols that are solid 

at ordinary temperatures but liquid at the rectal temperature and will therefore melt in the rectum 
and release the dmg. 

Solid dosage forms for oral administration may include capsules, tablets, pills, powders, 
and granules. In such solid dosage forms, the active compound may be admixed with at least 
25 one inert diluent such as sucrose lactose or starch. Such dosage forms may also comprise, as is 
normal practice, additional substances other than inert diluents, e.g., lubricating agents such as 
magnesium stearate. In the case of capsules, tablets, and pills, the dosage forms may also 
comprise buffering agents. Tablets and pills can additionally be prepared with enteric coatings. 

Liquid dosage forms for oral administration may include pharmaceutically acceptable 
30 emulsions, solutions, suspensions, syrups, and elixirs containing inert diluents conunonly used in 
the art, such as water. Such compositions may also comprise adjuvants, such as wetting agents, 
emulsifying and suspending agents, cyclodextrins, and sweetening, flavoring, and perfuming 
agents. 
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As tD the mode of administiation, it should be emphasized that it is the combination of 
therapeutic agents ttiat gives rise to its synergistic therapeutic effect no matter whetiier the first 
and the second agent are administered together or separately. Therefore, the two agents may be 
given together in a single dose or in separate ones with respect to space and time. 

5 Effective amounts of the compounds of die invention generally include any amount 

sufficient to detectably treat viral infections. 

Successful treafinent of a subject in accordance with the invention may result in the 
inducement of a reduction or alleviation of symptoms in a subject afflicted with a medical or 
biological disorder to, for example, halt the further progression of the disorder, or the prevention 
10 of the disorder. 

The amount of active ingredient that may be combined with the carrier matoials to 
produce a single dosage form will vary depending upon the host treated and the particular mode 
of administration. It will be understood, however, diat the specific dose level for any particular 
patient will depend upon a variety of factors including the activity of the specific compound 
15 employed, the age, body weight, general health, sex, diet, time of administration, route of 
administration, rate of excretion, drug combination, and the severity of the particular disease 
undergoing therapy. The therapeutically effective amount for a given situation can be readily 
determined by routine experimentation and is within the skill and judgment of the ordinary 
clinician. 

20 The compounds of the present invention can also be administered in the form of 

liposomes. As is known in the art^ liposomes are ^neraUy derived from phospholipids or other 
lipid substances. Liposomes are formed by mono- or-multilamellar hydrated liquid crystals that 
are dispersed in an aqueous medium. Any non-toxic, physiologically acceptable and 
metabolizable lipid capable of forming liposomes can be used. The present compositions in 
25 liposome form can contain, in addition to a compound of the present invention, stabilizers, 
preservatives, excipients, and the like. The preferred lipids are the phospholipids and 
phosphatidyl cholines (lecithins), both natural and synthetic. Methods to form liposomes are 
known in the art See, for example, Prescott, Ed., Methods in Cell Biology, Volume XIV, 
Academic Press, New York, N.W., p. 33 et seq (1976). 

30 While die SMIP compounds of the invention can be administered as the sole active 

pharmaceutical agent, they can also be used in combination with one or more other agents used 
in the treatment of SARSs. Other representative agents useful in combination with the 
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co^^)ounds of the invention for die treatment of viral infections include, for example, interferon, 
ribavirin, gancyclovir and the like. 

When additional active agrats are used in combination with the compounds of the present 
inv^tion, the additional active agents may g^erally be ranployed in therapeutic amounts as 
5 indicated in the Physicians' Desk Reference (PDR) 53”* Edition (1999), that is incorporated 
herein by reference, or such therapeutically useful amounts as would be known to one of 
ordinary skill in' the art 

The compounds of the invention and the other therapeutically active agents can be 
administered at the recommended maximum clinical dosa^ or at lower doses. Dosage levels of 
10 the active compounds in the compositions of the invention may be varied so as to obtain a 

desired therapeutic response dq>end[ing on the route of administration, severity of the disease and 
the response of the patient The combination can be administered as separate compositions or as 
a single dosage form containing both agents. When administered as a comb inati on, the 
therapeutic a^nts can be formulated as separate compositions that are given at the same time or 
15 different times, or the therapeutic agents can be given as a single composition. 

Compounds of the present invention can be readily synthesized using the methods 
described herein, or other methods, that are well known in the art . 

. The compounds can be used in the form of salts derived from inorganic or organic acids. 
These salts include but are not limited to the following: acetate, adipate, alginate, citrate, 

20 aspartate, benzoate, benzenesulfonate, bisulfate, butyrate, camphorate, camphorsulfonate, 
digluconate, cyclopentanepropionate, dodecylsulfate, ethanesulfonate, glucoheptanoate; 
glycerophosphate, hemisulfate, heptanoate, hexanoate, fiimarate, hydrochloride, hydrobromide, 
hydroiodide, 2-hydroxyethanesulfonate, lactate, maleate, methanesulfonate, nicotinate, 
2-napthalenesulfonate, oxalate, pamoate, pectinate, persulfate, 3-phenylproionate, picrate, 

25 pivalate, propionate, succinate, tartrate, thiocyanate, p-toluenesulfonate and undecanoate. Also, 
the basic nitro^n-containing groups can be quatemized with such agents as loweralkyl halides, 
such as methyl, ethyl, propyl, and butyl chloride, bromides, and iodides; dialkyl sulfates like 
dimethyl, diethyl, dibutyl, and diamyl sulfates, long chain halides such as decyl, lauryl, myristyl 
and stearyl chlorides, bromides and iodides, aralkyl halides like benzyl and phenethyl bromides, 
30 and others. Water or oil-soluble or dispersible products are thereby obtained. 

Examples of adds that may be employed to form pharmaceutically acceptable acid 
addition salts include such inorganic acids as hydrochloric add, sulphuric acid and phosphoric 
add and such organic adds as oxalic add, maleic acid, sucdnic acid and dtric acid. Basic 
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addition salts can be prepared in situ during the final isolation and purification of the compounds 
of formula CO, or separately by reacting carboxylic acid moieties with a suitable base such as the 
hydroxide, carbonate or bicarbonate of a pharmaceutical acceptable metal cation or with 
ammonia, or an organic primary, secondary or tertiary amine. Pharmaceutical acceptable salts 
5 include, but are not limited to, cations based on the alkali and alkaline earth metals, such as 
sodium, lithium, potassium, calcium, magnesium, al uminum salts and the like, as well as 
nontoxic ammonium, quaternary ammoniiun, and amine cations, including, but not limited to 
ammonium, tetramethylammonium, tetraethylammoniuin, methylamine, dimethylamine, 
trimethylamine, triethylamine, ethylamine, and the like. Other representative organic amines 
10 useful for the formation of base addition salts include diethylamine, ethylenediamine, 
ethanolamine, diethanolamine, piperazine and the like. 

Various compounds and methods of thdr synthesis are disclosed in international patent 
application Publication Nos. WO02/18327 (benzamide and pyridylamide based compounds); 
W00222598, and WO02/18383 (ABIQ based compounds); and WO 02/81443 (pthalamide base 
1 5 compounds), that have been found within context of this invention to be useful for immune 
potentiation. The entire disclosure of these U.S. and international publications is incorporated 
herein by this reference. Other compounds or intermediates of interest in the present invention 
were purchased from commercially available sources using the following method: the chemical 
stracture of interest was drawn into the ACD-SC database (from MDL Information Systems). A 
JO search of the following companies/institutions, among others, retrieved the identified 

compound’s supplier and purchasing information: ASDI, ASINEX, BIONET, CHEMBRIDGE, 
OEIEMDIV, CHEMEX, CHEMSTAR, COMGENEX, CSC, INTERBIOSCREEN, LABOTEST, 
MAYBRIDGE, MICROSOURCE/GENESIS, OLIVIA, ORION, PEAKDALE, RYAN 
SCIENTIFIC, SPECS, TIMTEC, U OF FLORIDA, and ZELINSKY. 

'5 , 

BENZAZOLE COMPOUNDS 
Scheme 1 

Compounds of the invention containing a benzimidazole core may be prepared usin g a 
number of methods fanuliar to one of skill in the art. hi one method, suitably functionalized 
•0 diamines may be coupled with various thioisocyanates to form the intermediate thioureas. 

Cyclization to form the benzimidazole moiety may be effected under known conditions such as 
with treatment carbodiimides or alkyl halides. Alternatively the diamines may be reacted 
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sequratially with ca^nyl diimidazole and phosphoryl chloride followed by coupling with the 
appropriate amine. 




Compounds containing the oxazole structure may similarl y be prepared according to the 
5 methods above or according to other known general procedures. Haviv et. al. (J. Med. Chem. 

1988, 31, 1719) describes a procedure for assembling oxazole cores wherein a hydroxy aniline is 
treated with ethyl potassium xanthate. The resulting sulfuryl benzoxazole may then be 
chlorinated and coupled with an amine. 




10 Compounds containing a benzothiazole core may also be prepared according to known 

methods. An ortho-halothioisocyanate may be reacted with an amine to form a thiourea. 
Reduction with NaH then allows formation of the thiazole ring. 




Benzotbiazoles may generally be substituted in accordance with the present invention, 
15 such as through the following synthetic pathway: 

Br-</T I ^ 

NMP, 200“C, 6min Ar 1M'»C,6min 

microwave microwave 
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Synthesis of 4-[(2-{[4-chloro-3-(trifluoromethyl)phenyl]aiiiino}- 

( .» 

lH-ben7im irfa7.nl-6- vnoxv1-N-nietilvlDVridine-2-caTt>oxamide 
The compound 4-[(2-{ [4-chloio-3-(trifluoromethyl)phenyl]amino}-lH-benzimidazol-6- 

5 yl)oxy]-N-methylpyridine-2-caiboxaimde (159322) was synthesized as follows: 




Step 1. Synthesis of 4-[(4-amino-3-nitrophenyl)oxy]-iV-methylpyridine-2-carboxanaide: 
A mixture containing 4-amino-3-nitrophenol (leq) and potassium bis(trimethylsilyl)amide (2eq) 
was stirred in dimethylformamide for 2 hours at room temperature. To this mixture was added 
10 (4-chloro(2-pyridyl))-^-methylcarboxamide (leq) and potassium carbonate (1.2eq) and stirred at 
90°C for 3 days. Thie reaction mixture was then concoitrated and partitioned between ethyl 
acetate and water. The organic layer was separated and washed with brine, dried, filtered, and 
concentrated in vacuum to give brown solid. Purification on silica gel (2% triethyl aminp. / 50% 
ethyl acetate in hexane) gave 4-[(4-amino-3-nitrophenyl)oxy]-JV-methylpyridine-2-carbbxamide 
15 as an orange solid. The product gave satisfactory NMR. HPIX, 3.39min; MS: = 289. 

Step 2. Synthesis of 4-[(3,4-diaminophenyl)oxy]-JV-methylpyridine-2-carboxamide: The 
mixture containing [4-(3-amino-4-nitrophenoxy)(2-pyridyl)]-iV- in methanol with catalytic 
amoimt of 10%Pd/C was hydrogenated until disappearance of the yellow color to yield the 
product amine. HPLC, 2.5mins; MS: Mlf = 259. 



>0 

Step 3. Synthesis of 4-[(2-{ [4-chloro-3-(trifluoromethyl)phenyl]amino}-lH- 
benz i midazol-6-yl)oxy]-N-methylpyridine-2-carboxamide: 

The mixture containing 4-[(3,4-diaminophenyl)oxy]-iV-methylpyridine-2-carboxamide (leq) and 
4-chloro-3-(trifluoromethyl)benzeneisothiocyanate (leq) in tetrahydrofuran was stirred at room 
t5 temperature for 16 hours to give the corresponding thiourea. To the resulting mixture was a dd <* d 
l-ethyl-3-(3Hiimethylammopropyl)carbodiiinide hydrochloride (2eq) and the mixture was stirred 
for another 10 hours. The mixture was concentrated and partitioned between ethyl nrptatP and 
water. The organic layer was washed with brine and dried. Purification on HPLC gave 4-[(2- 
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{[4-cMoro-3-(trifluoromethyl)phenyl]aix)ino}-lH-benziinidazoI-6-yl)oxy]-N-methylpyridiiie-2- 
caiboxamide. MS: MH*' = 462 



10 



15 



20 



25 



Synthesis of 4-({2-[(4-bromophenyl)aniino]-l-methyl- 

1 TT-hCTTiiniH azol-5-vl loxvVN-methvlDVridine-2-cafhoxamidft 
The compound 4-{{2-[(4-biDmophenyl)amino]-l-methyl-lH-benzimidazol-5-yl}oxy)-N- 

methylpyridine-2-caiboxamide (161651) was synthesized as follows: 



O 




Stq) 1. Synthesis of 4-{[3-amino-4-(methylamino)phenyl]oxy}-/V-methylpyridine-2- 
caiboxamide: A solution of 4-[(4-amino-3-nitrophenyl)oxy]-iV-methylpyridine-2-caiboxarnide 
(leq) in methylene chloride was treated with trifluoroacetic anhydride (leq) and stirred for 10 
minutes at 0 "C. The mixture was quenched with said. NaHCOa solution. The organic layer was 
sep^ted and washed with water, brine, dried and evaporated. MS: MBT=385.2 

To a solution of the trifluroacetamide (leq) in a rmxture of toluene, acetonitrile and 
sodium hydroude solution (50%) was added l»nzyltrimethylammonium chloride (leq) and 
dimethyl sulfate (1.2eq). The biphasic mixture was stirred overnight at room temperature and 
evaporated. The mixture was taken up in ethyl acetate, washed with water, brine, dried and 
evaporated. The crude product was purified by colunm chromatography eluting with 1:1 
hexanes and ethylacetate followed by 2% triethylamine in 1:1 hexanes and ethyl acetate followed 
by 2% triethylamine in 1:1 hexanes and ethyl acetate to afford iV-methyl-4-{ [4-(methylamino)-3- 
nitrophenyl]oxy}pyridine-2-carboxamide as a reddish orange solid. MS: MH*" = 303.1. 

The solution of nitromethylaniline in methanol was treated with 5% paUadium on carbon 
and stirred under hydrogen atmosphere for 15 min. (until the disappearance of yeUow coloration) 
at room temperature. Tlie mixture was filtered and the filtrate was concentrated to provide 0.36 
g of the diamine 4-{[3-amino-4-(methylamino)phenyl]oxy}-iV-methylpyridine-2-carboxamide. 
MS: MBT= 273.3. 



Step 2. Synthesis of 4-({2-[(4-bromophenyl)amino]-l-methyl-lH-benzimidazol-5- 
yl}oxy)-N-methylpyridine-2-carboxamide: A solution of the diamine 4-{ [3-amino-4- 
(methylamino)phenyl]oxy}-iV-methylpyridine-2-carboxainide (leq) in methanol was treated with 
4-bromophenylisothiocyanate (leq) and stirred at 60 °C - 65°C for 2 hours. Tbe reaction 
mixture was cooled down to room temperature and methyl iodide (leq) was added and stirred 

overnight at 60®C. The reaction was cooled to room temperature, evaporated, taken up in ethyl 
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acetate, and washed with water and brine, dried, and evqwrated under reduced pressure. 

Column chromatogrq)hy using a gradient solvent system of hexanes and ethyl ar/»tatP and either 
1:1 methylene chloride and acetone or 5% methanol in methylene chloride yielded the product as 
a half white powder. MS: MH*^=452.3 



AMINOBENZIMIDAZOLYLQUINOLINONES 

Con^raunds of structure I may be synthesiaed j&om simple starting molecules as shown in 
Schemes 1-4 and exemplified in the Examples: As shown in Scheme 1, compounds of structure 
I may generally be prepared using aromatic compounds substituted with atnin^^g ^d carboxylic 
.0 acid groups. 

Scheme 2. 

o o 




COzH 
NH 





As shown in Scheme 2, a substituted aromatic compound such as a substituted or 
unsubstituted 2-aminobenzoic add may be reacted with an acyl halide such as methyl 2- 
(chlorocarbonyl)acetate to produce an amide that will react with a substituted or unsubstituted 
5 1 ,2-diammobenzene. The resulting product is a 4-hydroxy-substituted compound of structure I. 

One skilled in the art will recognize that the procedure set forth in Scheme 1 may be modified to 
produce various compounds. 

A method for preparing 4-amino substituted compounds of structure I is shown in 
Scheme 3. As shown in Scheme 3, aromatic compounds substituted with aminp. and nitrile 
) groups may be used to synthesize 4-amino substituted compounds of structure I. A compound 
such as ethyl 2-cyanoacetate may be reacted with ethanol to produce ethyl 3-ethoxy-3- 
iminopropanoate hydrochloride. Subsequent reaction with a substituted or imgiihctitiitPH 
phenylenediamine provides substituted or unsubstituted ethyl 2-benzimidazol-2-ylacetate. 
Reaction of a substituted or unsubstituted ethyl 2-benzimidazol-2-ylacetate witfi an aromatic 
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compound having an amine and nitnle group such as substituted or unsubstituted 2 - 
aminobenzonitrile with a base such as bthium bis(trimethylsilyl)amide or a Lewis acid such as 
tin tetrachloride provides the substituted or unsubstituted 4-amino substituted compound of 
structure I. 



Scheme 3. 

EtOH 
HCI 






EtO‘ 



O NH • HCI 



3Et 




heat 





Scheme ,4 illustrates a ^nranl synthetic route that allows for the synthesis of 4- 
dialkylamino and 4-aUcylammo compounds of structure L An inspection of Scheme 3 shows that 
4-hydroxy substituted compounds of structure I may be converted into the 4-chloro derivative by 
reaction with phosphorous oxychloride or thionyl chloride. The 4-chloro derivative may then be 
reacted with an aUcylamine or dialkylamine to produce the corresponding 4-alkylamino or 4- 
dialkylamino derivative. Deprotection affords the final 4-alkylamino or 4-dialkylamino . 
compounds of structure I. Other groups that may be reacted with the 4-chloro derivative in this 
manner include, but are not limited to, ROH, RSH, and CuCN. 
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Scheme 4. 






EtOH 

HQ 





O 





As shown in Scheme 5, the synthesis of compounds of structure I having a H, alkyl 
group, aryl group, or heterocyclyl group in the 4-position may be accomplished using a 
substituted or unsubstituted 2-b en z imi da / ol-2-ylacetate prepared as shown in Schemes 3 and 4. 




R" = H, alkyl 

5 aryl, heterocyclyl 



TfflOSEMCARBAZONES 

General procedure for the preparation of thiosemicarbazones 
Scheme 6 



O 



H 



H 



AcOH 



S 



N 

H 



A solution of aldehyde (1.0 equiv.) and thiosemicaibazide (1,05 equiv.) in acetic acid was 
stirred overnight Excess of acetic acid was removed to give a residue, that was washed with 
ethanol, or purified by preparative-HPLC to give the thiosemicarbazone. 
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Scheme 7 

A solution of aldehyde (1.0 equiv.), tiiiosemicarbazide (1.05 equiv.) and acetic acid (0.1 
equiv.) in methanol was stiired ovemi^t Methanol was removed to give a residue, that was 
worked up as in Scheme 6. 

5 Scheme 8 

To a solution of {[(lB)-l-aza-2-(4-fluoro-3-nitrophenyl)vmyl]amino}-aminomethane-l- 
thione in ethanol was added an arylamine (2. 1 equiv.). The solution was stiired at room 
temperature imtil the starting fluoride dis^peared. The solution was purified to the product 




10 

Scheme 9 ’ 

A mixture of 4-(diethylamino)-2-hydroxybenzaldehyde (1 equiv.), benzylic bromide (1.2 
equiv.) and powder potassium carbonate in ethanol was stirred at room temperature for 2 da 3 rs. 
Ethanol was removed, and the residue was dissolved in ethyl acetate and water. The organic 
15 layer was washed with aqueous NaHCOs and brine, dried over Na 2 S 04 ., and concentrated. The 
residue was purified on silica gel eluting with ethyl acetate/hexane to give 4-(diethylamino)-2- 
benzoxylic-benzaldehyde. 

The aldehydes were converted to tbiosemicarbazones according to Scheme 7. 




Scheme 10 
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A solution of 3,4-difluoioben2enecaibonitrile (1 eqmv.), amine (1.5 equiv.) andDIEA (2 
equiv.) in jNMP was heated in a Smith Microwave (Phonal Chemistry) for 30 minutes. The 
reaction mixture was purified on silica gel to give 4-substituted 3-fluorobenzenecarbbnitrile. 

To a solution of nitrile in toluene at —78 °C was added DIBAL-H (1 M in toluene, 1.5 
5 equiv.). The reaction mixture was warmed to rt, and stirred for 16 h, and quenched with 
methanol/ediyl acetate/brine (1:1:4). After being stirred at rt for 30 min, the solution was 
extracted with ethyl acetate (3x). The combined organic layers were washed with aqueous 
NaHCOs, brine and concentrated. The alddhyde was purified on silica gel or directly conv«ted 
to thiosemicarbazones (Scheme 7). 

10 Scheme 11 

A solution of 2,4,5-trifluorobenzenecarbonittile (1 equiv.) and 4-arylpiperazine (1.2 
equiv.) and DIEA (1.2 equiv.) in TBF was heated at 80 **C for 2 hours. The mixture was.purified 
on silica gel to give 4-substituted 2,5-difluorobenzenecarbomtiile. 

Scheme 12 

15 To an alcohol (1.0 equiv) was added potassium r-butoxide in THF (1 M, 1.1 equiv). 

After 5 minutes, the solution was added to a solution of 4-N-substituted-2,5- 
difluorobenzenecatbonitrile (1 equiv.) in THF. The reaction mixture was stirred at rt overnight 
and quenched with aqueous armnonium chloride. The aqueous layer was extracted with, ethyl 
acetate (3x). The combined organic layers were washed with brine, and concentrated to give a 

20 residue, that was purified to give 4-N-substitoted-2-0-substituted-5-fluorobenzenecarbonitrile. 

. 4-N-substituted-2-0-substituted-5-fluorobenzenecarbonitrile was reduced with DIBAL-H 
to give a 4-N-substituted-2-0-substituted-5-fluorobenzaldehyde according to procedure in 
Scheme 10. 

The aldehyde was converted to tire corresponding thiosemicarbazone using Scheme 7. 

25 Scheme 13 

A solution of 4-N-substituted-2,5-difluorobenzMiecarbonitrile (1 equiv.), amine (1.5 
equiv.) and DIEA (2 equiv.) in NMP was heated in a Smith Microwave (Personal Chemistry) for 
30 minutes. The reaction mixture was purified on silica gel to give 4-N-substituted-2-N- 
substituted-5-fluorobenzenecaibonitrile. 

30 4-N-substituted-2-N-substituted-5-fluorobenzenecarbonitiile was reduced with DIBAL-H 

according to procedure described in Scheme 10 to give 4-N-substituted-2-N-substituted-5- 
fluorobenzaldehyde. 

Preparation of amino{3-[5-(3-chlorophenyl)(2-fi^l)](2-pyrazolinyl)}methane-l-thione 
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15 



20 



25 



To a solution of 5-(3-chlorophenyl)jEuran-2-carbaldehyde (1.0 equiv.) in THF at 0 ®C was 
added MeMgBr in ether (3.0 equiv.) and stirred for 45 min . The reaction was quenched with 
water, diluted with ether and filtered through Celite. The organic layer was separated and 
washed with brine, dried ovct MgS 04 , and concentrated to give the l-[5-(3-chlorophenyl)-2- 
furyl]ethan-l-ol. 

To a solution of secondary alcohol(1.0 equiv.) in CH 2 CI 2 was added Mn02 (10 equiv.). 
The reaction was stirred ovemigfit, filtered ttirough Celite, and concentrated to give l-[5-(3- 
chlorophenyl)-2-furyl]ethan-l-one. 

To a mixture of ketone (1.0 equiv.), paraformaldehyde (2.0 equiv.), and dimethylamine 
hydrochloride (2.0equiv) and molecular sieves in ethanol was added concentrated hydrochloric 1 
acid (cat). The reaction was refluxed overnight under nitrogen and the concentrated. A few 
drops of HCl was added, and the mixture was worked up with DCM and water. The organic 
layer was discarded. The aqueous layer was adjusted to basic and extracted with DCM (3x). 

The organic layer was washed with brine, dried over MgS 04 , and concentrated to yield 3- 
(dimethylamino]hl-[5-(3-chlorophenyl)(2-furyl)]propan-l-one. 

Thiosemicarbazide (1.0 equiv.) was dissolved in MeOH upon heating under nitrogen. 
Aqueous sodium hydroxide (6 M, 9.0 equiv.) was added to the reaction. A methanol solution of 
3-(dimethylamino)-l-[5-(3-chlorophenyl)(2-furyl)]propan-l-one (1.0 equiv) was then added 
dropwise to the reaction mixture. TTie solvent was removed and the residue was dissolved in 
DCM and washed with water, brine, dried oVer MgS 04 , and concentrated. The final compound 
was purified by preparative-HPLC to give amino{3-[5-(3-chlotophenyl)(2-furyl)](2- 
pyrazolinyl)}methane-l-thione; LC^MS m/z 306.2 (MH+); Rt =3.06 minutes . 

Scheme 14 




- 267 - 



wo 2004/092360 



PCT/US2004/011710 



To a solution of 4-pyridylmethylaiiime (1.0 eqdv.) and triethylamine (2.0 equiv.) in 
CHCI3 was added CS2 (1.0 equiv.)) and stiired overnight The reaction was cooled to 0 ®C and 
ethyl chloroformate (1.0 equiv.) was added dropwise. Tlie reaction was stirred for 15 min at 0 °C 
and then stirred at room temperature for 2 hrs followed by addition of (tert- 
5 butyl)oxycarbohydrazide (1.2 equiv.). After stirring for an addition hour the mixture was 
washed with aqueous citric acid (5%), saturated NaHCOa, brine, dried over MgS04, and 
concentrated. The desired Boc protected thiosemicarbazide was purified using column 
chromatography. 

To a solution of Boc protected thiosemicarbazide (1.0 equiv.) dissolved in DCM was 
10 added HCl in dioxane (2M, 8.3 equiv.) and stirred for 15 min. MeOH is then added to dissolve 
the precipitate, followed by addition of the furfural, and small amount of acetic acid (0.5 mL). 
The mixture is stirred overnight and the solvents are removed to give a residue purified by 
preparative-HPIiH to give the thiosemicaibazone. 

Synthesis of 4-[4-(4-methylpiperazm-l-yl)phenoxymethyl]beiizaldehyde 




To a solution of 4-piperazin-l-yl phenol (1 equivalent) in CHCI3, cooled to 0 °C, was 
added di-r-butyl dicarbonate (1 equivalent) in CHCI3 drop-wise. The solution was stirred at 0 ®C 
for 1 hour before removing from the cold bath and stirring at ambient temporatures for 18 hours. 
20 The organic solution was washed aqueous NaHC03 and brine dried over MgS04 and 
concentrated the crude material was used without pmification. 

A solution of the resulting 4-(l-BOC-piperazin-4-yl)phenol (1 equivalent) in dry CH3CN 
was slowly added drop-wise to a slurry of NaH (1 equivalent) in dry CH3CN at room 
temperature under N2. The slurry was stirred at room temperature for 2 hours before the solids 
25 were filtered and washed with Et20. 




Sodium 4-(l-BOC-piperazin-4-yl)phenoxide (1 equivalent) and methyl 4- 
bromomethylbenzoate (1 equivalent) were combined in dry acetone and heated to reflux at 60 ®C 
for 18 hours. The slurry was filtered and the filtrate was then concentrated to provide the crude 
JO methyl 4-[4-(l-BOC-piperazin-4-yl)phenoxymethyl]benzoate, that was used without 
purification. 
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To a slurry of liAULt (4 equivalents) in dry THF, cooled to 0 ®C under N 2 , was slowly 
added drop-wise a solution of mediyl 4-[4-(l-BOC-piperazin-4-yl)phenoxymethyl]ben2oate (1 
5 equivalent) in dry THF. Once the addition was complete, the slurry was heated to reflux at 80 
°C for 1 hour. Hie slurry was subsequently cooled to 0 °C and treated with water, 10% aq. 
NaOH and with water again. The resulting solids were filtered, and the filtrate was diluted with 
chloroform, washed with brine, dried over MgS 04 and concentrated, providing the crude 4-[4-(4- 
methylpiperazin-l-yl)phenoxymethyl]benzyl alcohol that was used without purification*. 




To a solution of DMSO (2.6 equivalents) in dry DCM , cooled to -78 °C under N 2 was 
added oxalyl chloride (1.1 equivalents) in DCM drop-wise. The solution was stirred at -78 °C 
for 5 minutes before a solution of 4-[4-(4-methylpiperazin-l-yl)phenoxymethyl]benzyl alcohol 
15 (1 equivalent) in DCM was added drop-wise, and allowed to stir at -78 °C for another 30 

minutes. Triethylamine.(2.5 equivalents) was slowly dripped in before allowing the solution to 
reach ^bient temperatures. The solution was washed with aqueous NaHCOa and brine, tMed 
over MgS 04 and concentrated to provide the crude 4-[4-(4-methylpipCTazin-l- 
yl)phenoxymethyl]benzaldehyde that was converted to thiosemicarbazones according to Scheme 
20 7. 
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PYRROLES 

Scheme 15 
Synthesis of Pyrrole 







HBrru, Hun!g*& Base. 
CHaCfe.Fn’. 50-70% yield 





DMAP. Die, CH2CI2. 
RT. 80 % yield 





10 Preparation of fe/f-butyl (2E)-3-(2,4-dichlorophenyl)prop-2-enoate (2). 

Neat Die (1.4 eq) was added to a well stined solution of cinnamate (1 eq), t-butyl 
alcohol (4 eq), DMAP (1.4 eq) and under argon at rL (Note - Tlie c innamate must be 

completely in solution that may require gentle warming. Allow the solution to cool to room 
temperature before adding the DIG. To avoid an exotherm on larger scales, it may be prudent to 
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dilute the DIG with CH 2 CI 2 before flie addition and have an ice bath ready.) After stirring for 8 
hours, the reaction develops a white precipitate. The reaction may be monitored by TLC eluting 
with 25% EtOAc/Hexane (Rf of product was 0.9). The entire reaction was loaded into a 
separatory funnel (washing with CH 2 Q 2 ). The organic mixture was washed with citrate, saL aq. 

5 NaHCOs, water, and brine. Hie organic layer was dried (Na 2 S 04 ), filtered, and concentrated to 
dryness to give the crude product as an oil. The crude oil was mixed with hexane and stirred for 
30 min. The precipitate that forms was filtered over celite and the filtrate was ev^orated. The 
hexane mixture was loaded onto a filter plug of silica and eluted with EtOAc/hexane (97:2 v/v). 
The first eluted UV active fractions are collected and evaporated to give >99% pure 2 (75-80% 
10 yields). 




’ Preparation of tert-bufyl 4-(2,4-dichlorophaiyl)pyrrole-3-carboxylate (3). 

15 Dry ether was added to NaH (1.5 eq as the oil dispersion) under argon. After decanting 

off the ether via syringe, the NaH was suspended again with fresh ether under argon. A solution 
of TOSMIC (1.1 eq) and 2 (1 eq) dissolved in a mixture of ether and DMSO was added dropwise 
to the stirred suspension of NaH at 0 °C over 20-30 min. The addition was mildly exothermic 
and evolved gas. After the addition, the reaction was allowed to warm to ambient rt. The 
10 progress of the reaction was followed by TLC (25% EtOAc/Hexane, the UV active product was 
at Rf = 0.4) and LCMS until done (~2-3 h). Upon completion, the reaction was carefully 
quenched with sat. aq. NH 4 CI (added slowly to avoid strong gas evolution and exotherm) and 
diluted with ether. The layers were separated and the organic phase was washed with saL aq. 
NaHCOa, water, and brine. The crude dark solid can be purified by recrystallization. Best 
15 results were achieved either through recrystallization directly from a mixture of hot 

EtOAc/hexane (1:3 v/v) or by dissolving the crude product in minimal hot EtOAc followed by 
addition of hexane (-2 volumes of hexane based on the volume of EtOAc). The hot solutions 
were allowed to cool to room temperature and age over night The crystals were first filtered and 
then washed with hexane giving 99% pure product in 60-70 % yield. 

JO 
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Preparation of fe/t-butyl 4-(2,4>dichlorophenyl)-l-[3-(l,3-dioxobenzo[c]azolm-2- 
yl)propyl]pyrrole-3-carboxylate (4). 

5 Solid NaH (1.5 eq as the oil dispersion) was added in small portions to a solution of 

pyrrole 3 (1 eq) and 3-bromopropyl phthalimide (1.2 eq) dissolved in DMF stirred at room 
temperature and flushed with argon. NOTE - Some gas evolves, but the temperature does not 
seem to rise above 40-50 °C. The reaction was stirred for 1.5 h at room temperature under argon. 
The reaction was followed by TLC (CH2Cl2/acetonitrile (95:5 v/v), the UV active product was at 
10 Rf = 0.5) and LXIMS. Upon completion, the reaction was quenched with saL aq. NH4CI (add 

slowly to avoid strong gas evolution and exotherm). Sat. aq. NaHCOs was then added to avoid 
an emulsion, and the basic organic mixture was extracted with ether. The combined ether layers 
were washed with sat. aq. NaHCOa, water, brine, dried Na2S04, filtered, and concentrated to 
dryness to give the crude product The crude product was purified by eluting through silica with 
15 EtOAc/Hexane (1:4 v/v). The purified product contained some residual 3-bromopropyl 

phthalimide, that did not int^ere with subsequent synthetic steps. The material was taken on 
and used without further purification. Assume a quantitative yield.. 




iO Preparation of terf-butyl l>(3-amuiopropyl)-4-(2,4-didhlorophenyI)p3nrrole>3*carboxylate 

( 5 ). 

The Pthalimido Pyrrole 4 (1 eq) was dissolved in ethanol and hydrazine (3 eq) at room 
tempCTature under nitrogen. Upon heating to reflux, the reaction generated a white precipitate. 
Stir at reflux until complete (~2 h) by TLC (CH2Cl2/acetonitrile (95:5 v/v), the UV active 
15 product was at Rf = 0.2) and LCMS. Upon reaching completion, the reaction was allowed to 

cool to room temperature and die precipitate was vacuum-filtered off using a medium to fine 

cintered-glass filter. The filtrate was concentrated under reduced pressure to a gummy solid. 
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The cnide matenal was taken up in ethanol/EtOAc (1:1 v/v), stined and the precipitate was 
filtered off in the same fashion as before. Hie filtrate was concentrated under reduced pressure 
and than (Med in vacuo for 10-15 min. This process of adding ethanol/EtOAc, filtering and 
concentrating was done one more time or as needed to remove the majority of the white 
5 piecnpitate and residual hydrazine. TTie product was thrai (Med in vacuo overnight The material 
was used without further purification. Once (Med, the reaction yielded the product as a glass 
(-87% yield over 2 steps). 

oXC" 

Huni^s Base, DMA, 

80 C, 58-60% yield 

10 

Prqiaration of teif-butyl l-{3-[(6-ainmo-5-nitro(2-pyridyl))ajiiiiio]propyl}-4-(2,4- 
didilorophenyl)pyrrole-3-carboxylate (7). 

To the premixed dry reagents, pyrrole 5 (1 eq) and powdered 6-chloro-3-nitro-2- 
pyridylamine (6) (1.1 eq), was added the DMA followed by HOnig’s base (2 eq) se(iuentially with 
15 stirring at rt. The reaction was then heated to 80 °C overnight The reaction was followed by 

TLC (EtOAc/hexane (1:1 v/v), the UV active yellow product was at Rf= 0.25), HPLC and 
LCMS. Upon completion as judged by HPLC, the reaction was allowed to cool to 70 °C. 
Ethylene diamine (anhydrous) was then added to the reaction to destroy any remaining 
chloropyridine 6. After 15 min stirring at 70 °C, the reaction was cooled and quenched with the 
20 addition of sat. aq. NaHCOa. The aqueous mixture was extracted with EtOAc, and the combined 
organic layers were washed with sat aq. NaHCOs, water, brine, dried, filtered, and concentrated 
to dryness to give the crude product as a brown-yellow solid. The crude product was purified by 
flash chromatography eluted with EtOAc/hexane (4:6 v/v). The purified SnAr adduct 7 was 
isolated in 58% yield as a yeUow solid. 

25 

TFA (10%v/i>) 

CHaCSj, 

1% water, 

RT. -.99% yield 
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Preparation of l-{3-[(6-ammo-5-iiitro(2-pyiidyl))aiiiino]propyl}-4-(2,4- 
(ficUoiophenyl)pyriDle-3-caiboxylic add (8). 

In a vial, TFA (catalytic amount) was added to a stirred mixture of /ert-butyl ester pyrrole 

7 (1 eq), water (.1%), and CH 2 CI 2 at rt. The vial stirred at room temperature imtil done (~12 h. 

5 The reaction was then concentrated under reduced pressure at room temperature and dried in 

vacuo. The crude residue was dissolved again in 01202 and concentrated under reduced 

pressure at rL Hie material was used in the final coupling step without further purification as the 

TFA salt 

yJ>H 

HgN 

HBTU, Hunig's Base. 

CH 2 Ct 2 ,RT, 50-70% yfeld 

10 

Preparation of iV>((lS>'2-hydroxy*isopropyl)(l-{3-[(6-aiiiino-5-nitro(2- 
pyiidyl))amino]propyl}-4-(2,4-diclilorophenyl)pyrrol-3-yl)carboxainide (9,). 

(2S)-(-t-)-2-Aminopropan-l-ol (1.5 eq) was added to a stirred mixture of acid (8) (1 eq), 
HBTU (1.5 eq), Htlnig's base (2 eq) and DMF (premixed sequentially in this order in a vial) at 
15 room terrperature under argon. The reaction was stirred for 3-4 h until cornplete as shown by 
LCMS and HPLC. Hie reaction mixture was subsequently diluted with EtOAc, washed with 
NaHCOs, and concentrated to afford a powder in a 70% yield. 

Nomenclature for the Example compounds was provided using ACD Name version 5.07 
software (November 14, 2001) available from Advanced Chemistry Development Inc. Some of 
20 the compounds and starting materials were named using standard lUPAC nomenclature. ' 

The compounds of Table 34 were synthesized following the synthetic methodology 
described above in the Examples and Schemes, and screened following methods 1 and 2 below. 
The precursors are readily recognizable by one skilled in the art and are commercially available 
from Aldrich (MUwaukee, WI) or Acros Organics CFittsburgh, PA), among others. 

25 

Screening mefiiods for SMIP/SMIS compounds 

Method 1 

Candidate small molecule immuno-potentiators can be identified in vitro. Compounds 
are screened in vitro for their ability to activate immune cells. One marker of such activation is 
30 the induction of cytokine production, for example TNF-a production. Apoptosis inducing .smaH 
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molecules may be identified having diis activity. These small molecule immuno-potentiatois 
have potential utility as adjuvants and immuno-therapeutics. 

hi an assay procedure (Hig^ Throughput Screening (HTS)) for small molecule immiTne 
potentiators (SMIPs), human peripheral blood mononuclear cells (PBMC), 500,000 per tuL in 
5 RPMI 1640 medium with 10% PCS, were distributed in 96 well plates (100,000 pra- well) 

already containing 5pM of conqiound in DMSO. The PBMCs were incubated for 18 h at 37°C in 
5% CO 2 . Their ability to produce cytokines in response to the small molecule compounds is 
determined using a modified sandwich ELIS A. 

Briefly supernatants from the PBMC cultures were assayed for secreted TNF using a 

10 primary plate bound antibody for capture followed by a secondary biotinylated anti-TNF 
antibody forming a sandwich. The biotinylated second antibody was then detected using 
streptavidin-Europium and the amount of bound europium was determined by time resolved 
fluorescence. SMTP compounds were confirmed by their TNF inducing activity that was- 
measured in the ^say as increased Europim counts over cells incubated in RPMI medium alone. 

15 “Hits” were selected based on their TNF-inducing activity relative to an optimal dose of 

lipopolysaccaride LPS (1 pg/ml), a strong TNF inducer. The robystness of the assay and low 
backgrounds allowed for the routine selection of hits with ~10% of LPS activity that was 
normally between 5-lOX background (cells alone). Selected hits are then subjected to 
confirmation fra: their ability to induce cytokines from multiple donors at decreasing 

20 concentrations. Those compounds with consistent activity at or below 5pM are considered 
confirmed for the purposes of this assay. The assay is readily modified for screening for 
compounds effective at higher or lower concentrations. 

Method 2 

25 Each of the compounds in the above Table 34 elicited TNF-a production in human 

peripheral blood mononuclear cells. Many of the compounds showed activity at less than 20 pM 
with respect to production of TNF-a Many of these compounds showed activity at less than 5 
pM with respect to production of TNF-a Many of these compounds showed activity in the 
production of TNF-a at less than 1.5 pM. 

30 For this reason, each of the R groups of any of the compounds listed in Table 34_are 

preferred. Additionally, because of the excellent activity of each of the compounds, each of 
these compounds is individually preferred and is preferred as a member of a group that includes 
any or aU of the other compounds and each compound is preferred in methods of modulating 
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immunopotentiation and in methods of treating biological conditions associated thomwith, for 
example to be used as a vaccine adjuvant Each of the compounds is also prefaced for use in 
preparation of medicaments for vaccines, immunopotentiation, reducing tumor growth and in 
treating biological conditions mediated therefrom. 

5 In additon to the procedure described above, methods of measuring other cytokines (e.g. 

ILl-beta, IL-12, IL-6, IPN-gamma, IL-10 etc.) are well known in the art and can be used to find 
active SMIP compounds of the present invoition. 

Compounds may be useful that cause production of TNF-a at higher concentrations, such 
as lOOpM, 200 pM or 300pM in the assays described herein. For example Loxoribine causes 
10 useful production of TNF-a at 300pM (see Pope et al Immunostimulatory Compound 7-ADyl-8- 
Oxoguanosine (Loxoribine) Induces a Distinct Subset of Murine Cytokines Cellular 
Immunology 162: 333-339 (1995)). 

The subject invention also includes isotopically-labeled antiviral compounds, that are 
15 structurally identical to those disclosed above, but for the fact that one or more atoms ate 

replaced by an atom having an atomic mass or mass number different from the atomic mass or 
mass number usually found in nature. Examples of isotopes that can be incorporated into 
antiviral compounds of the invention include isotopes of hydrogen, carbon, nitrogen, oxygen, 
phosphorous, sulfur, fluorine and chlorine, such as ^H, ’^C, ^'^C, ^P, 

20 '®F and ^Cl, respectively. Antiviral compounds of the present invention, derivatives thereof, and 

phanhaceuticaliy acceptable salts of said compounds and of said derivatives that contain the 
aforementioned isotopes and/or other isotopes of other atoms are within the scope of this 
invention. Certain isotopically-labeled antiviral compounds of the present invention, for example 
tho^ into which radioactive isotopes such as ^ and ^^C are incorporated, are useful in drug 
25 and/or substrate tissue distribution assays. Tritiated, i.e., and carbon- 14, Le., ^'^C, isotopes are 
particularly preferred for their ease of preparation and detectability. Further, substitution with 
heavier isotopes such as deutaium, ue., ^EL, may afford certain therapeutic advantages resulting 
from greater metabolic stability, for example increased in vivo half-hfe or reduced dosage 
requirements and, hence, may be preferred in some circumstances. Isotopically labeled antiviral 
10 compounds of this invention and derivatives thereof can generally be prq>ared by carrying out 
known or referenced procedures and by substituting a readily available isotopically labeled 
reagent for a non-isotopically labeled reagent 

hi accordance with the present invention, methods are provided for the administration of an 
effective amount of a SMIP compoimd to act as an adjuvant Also provided are immunogenic 

J5 compositions comprising a SMIP compound, an antigen, and optionally other adjuvants. 
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As adjuvants, the SMIP compounds are combined with antigens and delivery systems to 
form a final immunogenic composition or vaccine product. 

As immunotherapeutics, the SMDP compounds are used alone or in combination with 
other therapies for treatment of SARS. 

5 Those of ordinary skill in the art will recognize that physiologically active antiviral 

compounds, SMIPs or SMISs that have accessible hydroxy groups are frequently administered in 
the form of pharmaceutically acceptable esters. The antiviral compounds of this invention can 
be effectively administoed as an ester, formed on the hydroxy groups, just as one skilled in 
pharmaceutical chemistry would expect It is possible, as has long been known in 
10 pharmaceutical chemistry, to adjust the rate or duration of action of the antiviral compound by 
appropriate choices of ester groups. 

Other compounds that can be used in combination with the therapeutic agents described - 
hacein include, pentoxifylline (PTX), methylprednisolone, tiimetrexate (Neutrexin), Zadaxin 
(thymosin alpha 1), optionally sub^tuted 5-aniinomethiiiiiiiino-3-methyl-4-isoxazolecarboxylic 
15 acid phenylamides, cyclosporine A (CsA), 6-oxo-l,4,5-thiadiazin[2,3-h]quinazoline, 3-amino- 
2(lH)'thioxo-4(3fl)“<luiDazolinone, gangdclovir, glycyrrhizin, tetracyclines, aminoglycosides, 
quinolones,bicyclam(l,4-Bis(l,4,8,ll-tetraazacyclotetradec-l-ylmethyl)benzene 
octahydrochloride dihydrate), rapamycin, wortmannin, enalapril, roquinimex/linomide,' 
inactivin, DNCB, AG7088, 9-aminocamptothecin (CPT-11) , loxorobine, bropiiimine, Ononase 
20 (D (ranpimase), statins, such as: lovastatm— Mevacor®, pravastatin—Pravachol®, simvastatin— 

Zocor®, fluvastatin— Lescol®, atorvastatin — lipitor® and rosuvastatin— Crestor®. . ’’ 

As used herein, the term "effective amount" means an amount of antiviral compound of the 
conqmsitions, kits and methods of the presmit invention that is capable of treating the sympt oms 
of the described conditions. The specific dose of a compound administered according to this 
25 invention will, of course, be determined by the particular circumstances surrounding the case 
including, for example, the compound administered, the route of administration, the state of 
being of the patient, and the severity of the condition being treated. 

Hie dose of an antiviral compound of this invention to be administered to a subject is 
rather widely variable and subject to the judgment of the attending physician. It should be noted 
10 that it may be necessary to adjust the dose of a compound when it is administered in the form of 
a salt, such as a laureate, the salt forming moiety of which has an appreciable molecular weight 
The following dosage amounts and other dosage amounts set fo^ elsewhere in this 
description arc for an average human subject having a weight of about 65 kg to about 70 kg. ITie 
skilled practitionar will readily be able to determine the dosage amount required for a subject 
15 whose weight falls outside the 65 kg to 70 kg range, based upon the medical history of the 
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subject and the presence of diseases, e.g., diabetes, in the subject Calculation of the dosage 
amount for other forms of the free base form such as salts or hydrates is easily accom plishe d by 
performing a simple ratio relative to the molecular weights of the species involved. 

In g^eral, the pharmaceutical compositions will include at least one antiviral compound 
5 in combination with a pharmaceutically acceptable vehicle, such as saline, buffered saline, 5% 
dextrose in water, borate-buffered saline containing trace metals or the like. Formulations may 
further include one or more excipients, preservatives, solubilizers, buffering agents, lubricants, 
fillers, stabilize, etc. Methods of formulation are well known in the art and are disclosed, for 
example, in "Remington's Pharmaceutical Sciences," Mack Pub. Co., New Jersey (1991) or 
10 “Remington: The Science and Practice of Pharmacy,” 20* ed. , Lippincott Williams & Wilkins, 
Baltimore, Maryland (2000), incorporated herein by reference. 

Pharmaceutical coropositions for use within the present invention can be iii the form of 
sterile, non-pyrogenic liquid solutions or suspensions, coated capsules, suppositories, lyophilized 
powders, transdenhal patches or other forms known in the art. 

15 Many of the active ingredient antiviral compounds are known to be absorbed finm the 

alimentary tract, and so it is usually preferred to administer a compound orally for reasons of 
convenience. However, the compounds may equally effectively be administered intravenously, 
subcutaneously, percutaneously, or as suppositories for absorption by the rectum or vagina, if 
desired in a given instance. of the usual types of compositions may be used, including 
20 tablets, chewable tablets, capsules, solutions, parenteral solutions, troches, suppositories and 
suspensions. Compositions are formulated to contain a daily dose, or a convenient fraction of 
daily dose, in a.^sage unit, that may be a single tablet or capsule or convenient volume of a 
liquid. 

Capsules am prepared by nuxing the compound or compounds with a suitable diluent and 
25 filling the proper mount of the mixture in capsules. The usual diluoits include inert powdered 
substances such as starch of many different kinds, powdered cellulose, especially crys tallin e and 
microcrystalline cellulose, sugars such as fmctose, manni tol and sucrose, grain flours and similar 
edible powders. 

Tablets are prepared by direct compression, by wet granulation, or by dry granulation. 

30 Their formulations usually incorporate diluents, binders, lubricants and disintegrators as well as 
die conqiound or compoimds. Typical diluents include, for example, varioos types of starch, 
lactose, mannitol, kaolin, calcium phosphate or sulfate, inorganic salts such as sodium chloride - 
and powdmd sugar. Powdered cellulose derivatives are also useful. Typical tablet binders are 
substances such as starch, gelatin and sugars such as lactose, fructose, glucose and the like. 

35 Natural and synthetic gums are also convenient, including acacia, alginates, metbylcellulose, 
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polyvinylpynolidine and tiie like. Polyethylene glycol, ethylcellulose and waxes can also serve 
as binders. 

A lubricant is generally necessary in a tablet formulation to prevent the tablet and punches 
from sticking in the die. Hie lubricant is chosen from such slippery solids as talc, magnesium 
5 and calcium stearate, stearic acid and hydrogenated vegetable oils. 

Tablet disintegrators are substances that swell when wetted to break up the tablet and 
release the compound or compounds. Ihey include starches, clays, celluloses, algins and gums, 
more particulariy, com and potato starches, methylcellulose, agar, bentonite, wood cellulose, 
powdered natural sponge, cation-exchange resins, alginic acid, guar gum, citrus pulp and 
10 caiboxymethylcellulose, for example, may be used as well as sodium lauryl sulfate. 

Tablets are often coated with sugar as a flavor, and sealant, or with film-forming protecting 
agents to modify the dissolution properties of the tablet liie compounds may also be fonnulated 
as chewable tablets, by using relatively large amounts of pleasant-tasting substances such as 
mannitol in the foimulation, as is now well-est^lished in the art 
15 When it is desired to administer a compound as a siq)pository, the typical bases may be 

used. Cocoa butter is a traditional suppository base, that may be modified by addition of waxes 
to raise its melting point slightly. Water-miscible suppository bases comprising, particularly, 
polyethylene glycols of various molecular weights are in wide Use. 

The effect of the compounds may be delayed or prolonged by proper formulation. For 
20 example, a slowly soluble pellet of the compound may be prepared and incorporated in a tablet 
or capsule. The technique may be improved by making pellets of several different dissolution 
rates md filling c^ules with a mixture of the pellets. Tablets or capsules may be coated with a 
film that resists dissolution for a predictable period of time. Even the parenteral preparations 
may be made long-acting by dissolving or suspending the compound or compounds in oily or 
25 emulsified vehicles that allow dispersion slowly in the serum. 

The combinations of this invention may be administraed in a controlled release formulation 
such as a slow release or a fast release formulation. Such controlled release formulations of the 
combination of this invention may be prepared using methods well known to those skilled in the 
art. The method of administration wfil be determined by the attendant physician or other person 
W skilled in the art afto: an evaluation of the subject’s condition and requirements. 

The term “prodrug” means compounds that are transformed in vivo to yield an antiviral 
compound of the preset invention. The transformation may occur by various mechanisms, such 
as through hydrolysis in blood, A good discussion of the use of prodrugs is provided by T. 
Ifiguchi and W. Stella, “Pro-drugs as Novel Delivery Systems,” Vol. 14 of the A.C.S. 

>5 Symposium Series, and in Bioreversible Carriers in Drag Design, ed. Edward B. Roche, 

American Pharmaceutical Association and Pergamon Press, 1987. The term, “prodrug” also 
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encon^asses mutual piodrugs in which one or more antiviral compounds are combined in a 
single molecule that may then undergo transformation to yield the individual antiviral 
compounds of the present invention. 

For example, if an antiviral compound of the present invention contains a carboxylic acid 
5 functional group, a prodrug can comprise an ester formed by the replacement of the hydrogen 

atom of the acid group with a group such as (Ci-C8)alkyl, (C 2 -Ci 2 )alkanoyloxymethyl, 1- 
(alkanoyloxy)ethyl having from 4 to 9 carbon atoms, l-methyl-l-(alkanoyloxy)-ethyl having 
from 5 to 10 carbon atoms, alkoxycarbonyloxymethyl having from 3 to 6 carbon atoms, 1- 
(alkoxycaibonyloxy)ethyl having from 4 to 7 carbon atoms, 1-methyl-l- 
10 (alkoxycarbonyloxy)ethyl having from 5 to 8 carbon atoms, N-(alkoxycarbonyl)aminomethyl 
having from 3 to 9 carbon atoms, l-(N-(alkoxycarbonyl)amino)ethyl having from 4 to 10 carbon 
. atoms, 3-phthalidyl, 4-crotonolactonyl, gamma-butyrolacton-4-yl, di-NJ^-(Ci-C 2 )aIkylarriino(C 2 - 
C 3 )alkyl (such as P-dimethylaminoethyl), carbamoyl-(Ci-C 2 )alkyl, NJ*J-di(Ci- 
C 2 )alkylcarbamoyl-(Ci-C 2 )aIkyl and piperidino-, pyrroUdino- or morpholino(C 2 -C 3 )aIkyl. 

15 Similarly,' if an antiviral compound of the present invention comprises an alcohol 

functional group, a prodrag can be formed by the replacement of the hydrogen atom of the 
alcohol group with a group such as (Ci-C6)alkanoyloxymethyl, l-((C]-Q)alkanoyloxy)ethyl, 1- 
methyl-l-((Ci-C6)allcanoyloxy)ethyl, (Ci-C6)aIkoxycarbonyloxymethyl, N-(Ci- 
Q)alkoxycarbonylaroinomethyl, succinoyl, (Ci-C6)alkanoyl, a-amino(Ci-C 4 )alkanoyl, arylacyl 
10 . and a -aminoacyl, or a-arninoacyl-a-aminoacyl, where each a-aminoacyl group is independently 
• selected from the naturally occurring L-amino acids, P(0)(OH)2, -P(0)(0(Ci-C6)alkyl)2 or 
glycosyl (the radical resulting from the removal of a hydroxyl group of the hemiacetal f orm of a 
carbohydrate). 

If an antiviral compound of the present invention comprises an amine functional group, a 
15 prodrug can be formed by the replacement of a hydrogen atom in the aminp. group with a group 
such as R*-carbonyl, R^O-carbonyl, NR^R^'-carbonyl where R* and R^' are each independently 
((Ci-Cio)aIkyl, (C 3 -C 7 )cycloalkyl, benzyl, or R* -carbonyl is a natural a -aminoacyl or natural a- 
aminoacyl-natural a-aminoacyl, -C(OH)C(0)OY* wherein (Y* is H, (Ci-C6)alkyl or benzyl), - 
C(OY^®) Y^^ wherein Y*® is (C1-C4) alkyl and Y*^ is ((Ci-Qi)alkyl, carboxy(Ci-C6)alkyl, 

10 amino(Ci-C 4 )alkyl or mono-N- or di-NJ^-(Ci-C6)allqrlammoalkyl, -C(Y”) Y” wherein Y^ is 
H or methyl and Y^ is mono-N- or di-NJ^-(Ci-C6)alkylamino, morpholino, piperidin-l-yl or 
pyrrolidin-l-yl. 

Hie compositions for use in accordance with the present invention may be formulated in a 
conveiitional maimer using one or more physiologically acceptable carriers or excipients. 

15 Antiviral, SMIP, SMB, or other immunomodulating compounds are prepared or obtained as 

described herein and in the US Patents and published intmnational patent applications listed in 
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Table 1, Table 2, Table 34 and Table 35. The antiviral compounds can be formulated in 
phaimaceutically acceptable compositions suitable for delivery to the lungs. Particular 
formulations include dry powders, liquid solutions or suspoisions suitable for nebulization and 
propellant formulations suitable for use in metered dose inhalers. The preparation of such 
5 formulations is weU know to those sldlled in the art, and is described in US Patent Nos. 

5,814,607 and 5,654,007 and in the US Patents and published international patMit applications 
listed in Table 3 the disclosures of which ate incorporated herein by reference. 

Dry powder formulations wiU comprise an antiviral compound in a dry, optionally 
lyophilized form with a particle size within a preferred range for deposition within the lung. 

10 Typically the particle size for.deposition in the lung will range between 1 and 5 nm. Wheri 

systemic delivery of the antiviral compound via absorption from the lung into the bloodstream is 
desired the antiviral compound formulation particle size is generally between 0.1 and 2 (im in 
■ size; The preferred size range of particles can be produced using methods such as jet-milling, 
spray drying and solvent precipitation, for example. Dry powder devices typically require a 
15 powder mass in the range from about 1 mg to 100 mg to produce an aerosolized dose. Thus, the 
antiviral compoimd will typically be combined with a pharmaceutically acceptable dry bulking 
powder. Preferred dry bulking powders include sucrose, lactose, trehalose, human serum 
albumin (HS A), phospholipids and glycine as weU as those disclosed in the documents listed in 
Table 3. Dry powders can be administered to the subject in conventional dry powder inhalers. 

20 For liquid formulations the antiviral compound can be dissolved in any recognized 

physiologically acceptable carrier for use in delivery of aerosolized formulations. Such carriers 
include buffered and unbuffered aqueous solutions for water soluble compounds, and ■ ^ 
physiological solutions including saline solution ^neferably between 0.2 and 2 N NaQ). For 
antiviral compounds with limited solubility, other liquid vehicles such as ethanol, propylene 
25 glycol and ethanol-propylene combinations may be used. The antiviral compounds may also be 
a dmini stered as soUds in suspension. i 

For administration by inhalation, the compositions for use according to the present 
invention are conveniently delivered in the form of an aerosol spray administered via pressurized 
packs or a nebulize, with the use of a propellant, e.g., air, dichlorordifluoromethane, 

30 dichloroterafluoroethane or other suitable gas. Preferably, for incorporation into the aerosol 
propellant, the antiviral compound formulations of the present invention will be processed into 
respirable particles as described above for the dry powder formulations. Ihe particles are then 
suspended in the propellant, optionally being coated with a surfactant to enhance their 
disbursement In the use of a pressurized aerosol die dosage unit may be determined by 
35 providing a valve to deliver a metered amoimt 
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Commcrcidlly ovsiloblc Jet nebulizers ore ovoileble ond moy be used to deliver oerosolized 
antiviral compound to a subject Such jet nebulizers include, but are not to, those 

supplied by AeroTech 11 (C3S-US, Bedford, Mass.). In addition, for delivery of am)solized 
antiviral compound to the lungs of a subject an oxygen source can be attached to the nebulizer 
5 providing a flow rate of, for example, 10 L/min. In ^neral, inhalation is performed over a 5-40 
minute time interval through a mouthpiece during spontaneous respiration. Tlie preset 
invention provides for novel compositions comprising a suitable carri^ and aerosolized antiviral 
compound in doses suflBcient to reduce or ameliorate viral load and SARS symptoms in subjects 
having SARS. Such doses can be lower than corresponding systemic doses that may be u$ed to 
10 those generally used to reduce or ameliorate viral load and SARS symptoms in subjects having 
SARS. 

The antiviral, SMIP, SMIS, and immunomiodulating conopositions of the present invention 
may be administered with a steroidal anti-inflainmatory drug for the treatment of SARS and 
SARS syn^itoms. Bxamples of steroidal anti-mflammatory drugs of the invention include 
15 hydrocortisone, prednisolone, dexamethasone, triamcinolone acetonide, fluocinolone acetonide, 
fludrocortisone acetate, betamethasone, etc. 

The antiviral con^und composition of the invention is nebulized predominantly into 
particle si^s allowing a delivery of the drug into the terminal and respiratory - bronchioles. For 
efficacious delivery of antiviral compound to the lung endobronchial space of airways in- an 
JO aerosol, the formation of aerosol particles having mass medium average diameter predominantly 
betwerai 1 to. 5 ^m is necessary. The formulation must additionally provide conditions -that 
would not adversely affect the functionality of the airways. Consequently, the formulation must 
contain enough of the drug formulated under the conditions that allow its efficacious delivery 
while avoiding undesi^le reaction. 

15 For liquid solutions and suspensions, the choice of the nebulizer is made from among 

commercially available nebulizers. The jet nebulizers known as Sidestream O, obtained fiom 
Medicaid and Pari LCS, LC Plus, and eFlow obtained from Pari Respiratory Equipment, 
Richmond, Virginia, are examples of typical nebulizers suitable for the practice of the invention. 
Ultrasonic nebulizers that produce appropriate particle sizes of about 1 to 5 nm such as 
iO Aerosonic by DeVilbiss and UltraAire by Omron are also suitable. 

Advantageously, the present invention also provides for a kit for use by a consumer for the 
treatment and/or prevention of SARS. Such a kit comprises: (a) a pharmaceutical composition 
comprising a therapeutically effective amount of at least one compound from among those 
described herein or listed in Table 34 and Table 35 or described in the US Patents and published 
■5 international patent applications listed in Table 1, Table 2, and Table 35 and a pharmaceutically 
acceptable earner, vdiicle or dilurat; (b) a container for holding the pharmaceutical composition; 
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and, optionally, (c) instructions describing a method of using the pharmaceutical compositions 
for the treatment and or the prevention of SARS. Ihe kit may optionally contain a plurality of 
antiviral compounds for the treatment of SARS wherein the anti viral compounds are selected 
from 3C-like protease inhibitors and p^ain-like protease inhibitors. In a further embodiment, 

5 the kit contains an antiviral compoimd which is an RNA-dependent RNA polymerase inhibitor. 
When the kit comprises more than one antiviral compotmd, the antiviral compounds contained in 
the kit may be optionally combined in the same pharmaceutical composition. 

A "kit" as used in the instant application includes a container for containing the s^arate 
compositions such as a divided bottle or a divided foD packet. The container can be in any 
10 conventional shape or form as known in the art that is made of a pharmaceutically acceptable 
mat^al, for example a paper or cardboard box, a glass or plastic bottle or j ar, a re-sealable bag 
(for example, to.hold a "refill" of tablets for placement into a different container), or a blister 
pack with individual doses for pressing out of the pack according to a therapeutic schedule. The 
container employed can dq>end on the exact dosage form involved, for example a conventional 
15 cardboard box would not generally be used to hold a liquid suspension. It is feasible that more 
than one container can be used together in a single package to market a single dosage form. For 
example, tablets.may be contained in a bottle that is in turn contained within a box. 

An example of such a kit is a so-called blister pack. Blister packs are well known in the 
packaging industry, and are bmng widely used: for the packaging of pharmaceutical unit dosage 
20 forms (tablets, capsules, and the like). Blister packs generally consist of a sheet of relatively stiff 
material covered with a foil of a preferably transparent plastic material. During the packaging 
process, recesses are formed in the plastic foil. The recesses have the size and shape of 
individual tablets or capsules to be packed or may have the size and shape to acconunodate 
multiple tablets and/or capsules to be packed. Next, the tablets or capsules are placed in the 
25 recesses accoidihgly and the sheet of relatively stiff material is sealed against the plastic foil at 

the face of the foil that is opposite from the direction in which the recesses were formed.' As a 
result, the tablets or capsules are individually sealed or collectively sealed, as desired, in the 
recesses between the plastic foil and the sheet. Preferably the strength of the sheet is such that 
the tablets or capsules can be removed from the blister pack by manually applying pressure on 
30 the recesses whereby an opening is fonned in the sheet at the place of the recess. Hie tablet or 
capsule can then be removed via said opening. 

It maybe desirable to provide a written memory aid, wh^ the written memory aid is of the 
type containing information and/or instructions for the physician, pharmacist or subject, e.g., in 
the form of numbers next to the tablets or capsules whereby the numbers correspond with the 
35 days of the regimen which the tablets or capsules so specified should be ingested or a card that 
contains the same type of information. Another example of such a memory aid is a calendar 
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printed on the card e.g., as follows "First Week, Monday, Tuesday,” . . .etc “Second Week, 

Monday, Tuesday, . . etc. Other variations of memory aids will be readily q>paient A "daily 
dose" can be a single tablet or capsule or several tablets or c^sules to be taken on a given day. 
Also a daily dose of one or more component(s) of the kit can consist of one tablet or capsule 
S while a daily dose of another one or more component(s) of die kit can consist of several tablets 

or capsules. 

Another specific embodiment of a kit is a dispenser designed to dispense the daily doses 
one at a time in the order of their intended use. Preferably, the dispenser is equipped with a 
memory-aid, so as to further facilitate compliance with the regimen. An example of such a 
10 memory-aid is a mechanical counter that indicates the number of daily doses that has been 

dispensed. Another example of such a memory-aid is a batl^-powered micro-chip memory 
coupled with a liquid crystal readout, or audible reminder signal which, for example, reads out 
the date that the last daily dose has been taken and/or reminds one when the next dose is to be 
taken. 

15 

EXAMPLES 

Example 1-EXAMPLE of a SARS VIRUS ISOLATE 

A SARS virus was isolated from clinical specimens of a patient in Frankfint, Germany 
(FRA). Ihe isolate was grown in Vero cells. RNA of the SARS virus was extracted and: 

20 amplified by RT-PCR. Nucleotide sequence of the viral genome was determined by direct 
sequencing of the PCR product Computer analysis was used to predict the features of the 
genome, to compare it to previously known coronaviruses and to the sequence of different SARS 
virus isolates. ' 

More specifically, isolation and sequence was performed as follows. After the third 
25 passage of the SARS virus in Vero ceUs, viral particles were purified by ultra centrifugation 

from 3x10 cells supernatant Viral RNA was extracted by Triazol method (Gibco-BRL). Viral 
RNA (200 ng) was transcribed into cDNA with avian RNaseH- thermostable reverse 
transcriptase following the instractions of the manufacturer (Th^moScript RT System, 
Ihvitrogen). Briefly, eitiier 50 pmoles of oligo (dT) 2 o (SEQ ID NO; 7389) or 25 ng of random 
30 hexamers were used to prime die RT reaction in a 20 pi final volume. Amplification and 
sequencing of the SARS genome were accomplished by direct sequencing of PCR products 
obtained with: i) specific primers firom conserved regions of homology found through multiple 
alignment among known coronaviruses; ii) oligonucleotides designed around short sequences of 
SARS isolates available on the Web through WHO network laboratories; iii) degenerate primers 
35 to amplify the cDNA mixture with multiple overlapping fragments as end products. Gap closure 
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was lealized by long distance PCR witb high fidelity Taq (Expand High Rdelity system, Roche) 
using primers designed on selected fragments. Sequence was collected by primer walking using 
a Bi^ye tmminator chemistry (Applied Biosystems) and an automated DNA sequencer (3700 
capillary model. Applied Biosystems). After obtaining a first pass of the entire genome, a set of 
5 both forward and reverse primers were used to amplify and sequence de novo the genome using 
as a template DNA segments of 2 kb on avmnge. Readings from overlapping fragments were 
automatically assembled by AutoAssembler (Applied Biosystems) and the 29,740 bp contiguous 
edited manually. 

Computer analysis of the sequence was performed as follows. The GCG Wisconsin 
10 Package suite (version 10.0) was used for computer analysis of gene and protein sequences. Tlie 
PSORT program (h^p:/^sortnibb.ac.jp/) was used for localization predictions. For secondary 
stnicture analysis, the PHD software available on the Web at 

hltp://cubkJ>wc.columbia.edu/predktpwtem/viZS applied. The PSI-BLAST algorithm was used for 
homology searches ihttp:/Amw.rui>Lnlm.mh.govd>lasi) using the non-redundant protein database. 

15 QustalW was applied to obtain multiple sequence alignments of gene and protein sequences. 

The LeamCoil-VMF program was used to predict coded-coil regions in the spike proteins 
{ht^:/AearncoU-vmf.lcs.niiLedu/cgi-bin/vn^. Leucine zippers were predicted with the program 223P, 
available at http://2Zip.molgen.mpg.de. 

Phylogenetic analysis was performed using the neighbor-joining algorithm as implemented 
20 in the program NEIGHBOR within the Phylogeny Inference Package (Phylip) (Felsenstein J 
1993, program distributed by the author). Bootstrap analysis was always performed with 100 
rqplicates using the program Seqboot Trees were handled and displayed using TreeVifew. Tlie 
program HMMER was used to generate sequence profiles from multiple sequence alignments of 
the SI domains of spike proteins. Subsequently, the HMMPFAM program was used to compare 
25 tile SI domain of SARS spike to the profiles. 

Hie genome of this SARS virus isolate is 29,740 bases long and the overall structure of the 
genome is similar to that of the three known groups of coronaviruses. Starting fiom the 5’ end a 
leader sequence, an untranslated region (UTR) and two overlapping open reading finmes coding 
for one polyprotein containing the enzymes necessary for replication can be identified. They are 
30 followed by a region coding for the spike (S), envelope (E), matrix (M), nucleocapsid (N) 

structural proteins and right additional ORFs specific for the SARS virus. At the 3'-end of the 
genome a UTR with a poly(A) is located. Tlie overall homology to coronaviruses groups 1, 2 
and 3 is low and therefore the SARS virus belongs to a new group (group 4) of coronavirus. 
More detailed analysis of the spike protein amino arid sequence shows that the SARS virus 
35 isolate is mote closely related to coronavirus group 2. 
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The complete genome sequence of the SARS virus isolate is 29,740 bp in length. TTie 
sequence is available on Genbank and has a GC content of 40.8%, comparable with that of 
known viruses of the same family. Genome structure is similar to that of other coronaviruses. 14 
open reading frames have been predicted. The principal features of the genome and gene 
5 products are illustrated reported in Kgure 17 and Table 10. The comparison between die SARS 
genome and those of group 1, 2 and 3 coronaviruses is reported in Hgure 18. 

Nucleotides 1-73 contain a predicted RNA leader sequence followed by an untranslated 
region (UTR) of 197 nucleotides. The UTR is followed by two overlapping open reading frames 
(ORFla, ORFlb), which «icompass two-thirds of the genome (nucleotides 265-21485). They 
10 encode for a large polyprotein, which is predicted to be processed by viral proteases to gen^te 
the replicase complex. The 3’ part of the . genome contains the genes coding for the four 
structural proteins (S, spike protein, E, envelope protein, M, matrix glycoprotein, and N, • 
nucleocapsid protein), and ei^t predicted ORFs of unknown fimction (Figure 17). Rn^y, at the 
3’ end of the genome, we found a second UTR ot340 bases followed by a poly(A) tract We ' 
15 identified a putative intergenic (IG) sequence also referred to as transcription-associated 

sequence (TAS), which is a typical feature for coronaviruses. The IG sequence is characterized 
by 6-18 nucleotides present at the 3’ end of the leader and can be found in front of each gene. 
The IG sequence plays a key role in RNA transcription and its regulation. The IG sequence of 
the SARS virus is characterized by the sequence SEQ ID NO: 7293 and is present nine times in 
20 the genome (Rgure 17). The sequence of the leader and IG are peculiar for each coronavirus and 
represent a specific signature for the virus. 

The Replicase Region 

The replicase gene, ORFlab (SEQ ID NO: 7232), consists of two overlapping ORFs, 
ORFla and ORFlb, which can be translated as a single polyprotein by frame shift of the 
25 ribosome in position 13,393, within the polyrherase encoding region. See Brierley et al, Enibo J 
1987: 6(12): 3779-3785. As expected, a stem-loop sequence is present ten base pairs 
downstream of this site (SEQ ID NO: 7390; 

5'- CGGTGTAAGT GCAGCCCG Tl;iTACACCG- 3'V The polyprotein is cleaved co- and/or 
post-translationally into multiple proteins by its own encoded proteases. Using the cleavage 
30 consensus sequence and by analogy with otiier coronaviruses, we have mapped the possible 

cleavage sites of the polyprotein and have identified 14 products, which comprise the leader 
protein p28, the homologue of the MHV p65 protein and other twelve proteins, named from nspl 
to nspl3 (nsp, non structural protein) (Rgure 17 and Table 10). The amin o acid sequence 
analysis suggests the presence of several functional motifs wifliin the putative ORFlab proteins. 
35 In particular, we have mapped two potential proteases (nspl and nsp2), one growth factor-like 

motif (nspT) within ORFla, whereas in ORFlb we identified the RNA polymerase (nsp9), and a 
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predicted helicase (nsplO). Tlie other predicted cleava^ products (nsp3, nsp4, nsp5, nsp6, 
nspl 1, nspl2 and nspl3) are proteins of unknown hmction. Many of these proteins are 
presumably present in the Kt^A replication complex, which is associated with the mem bra nous 
structures in the infected cells. In particular, nsp3 and nsp4 contain hydrophobic domains. As 
5 shown in Hgure 18, the replicase region of SARS has a similar organization to group 1, 2 and 3 
coronaviruses; however, the ovmaU aminoacid consmiration is low (Table 11). Tbe most 
conserved proteins are the polymerase and the helices. 

Nspl is the papain-hke cysteine protease (PLP), which cleaves the first two protein 
products Oeader protein p28 and p65 homologue). Within the nspl of MHV, two domains with 
10 papain-like protease activity (PLPl and PDP2) have been mapped, (Kanjanahaluethai et cd 
(2000) J. Virol 74(17):7911-21) which are also conserved with Bovine, transmittable 
gastrorateritis virus CTGV) and Human 229E coronaviruses. However, by sequence alignment 
with the SARS nspl, we identified only one PLP domain containing the catalytic residues 
Cys833 andHis994. 

15 Nsp2 is the chymotrypsin-picomavirus 3C-like protease (3CLp), which is responsible for . 

the post-translational processing of the other 12 proteins, most of them cleaved at Q/A or Q/S 
sites, ^ebuhr et.al (1999) J. Virol 73(1): 177-85). It also performs autoproteolydc activity. The 
principal catalytic residues are well conserved with other coronaviruses and are located at 
position His41 and Cysl45. Furthmnore, even the conserved aminoacids Tyrl61 and Hisl63, 

20 which are believed to be involved in substrate recognition and to be indispensable for proteolytic 
activity, (Hegyi et cd (2002) J. Gen Virol 83(E*t3):581-593) were found in the sequence of.the 
SARS 3CLp. 

Tbe invention includes the orflab sequence of SEQ ID NO: 9960 and the orfla sequence 
of SEQ ID NO: 9961, including fragments, variants, hornologs, etc. thereof. 

25 The Structural Reeion 

Analysis of the nucleotide sequence at the 3’ part of the SARS genome identified 12 . 
predicted open reading fimnes. They are coded within 8.2 kb and comprise the four structural 
proteins S, E, M and N, common for all coronaviruses and eight predicted ORPs, which ate 
specific for this virus (Rgure 18). SARS-specific IG sequences upstream of most ORFs (Bgures 
30 17 & 18) suggest that most genes are likely to be transcribed independently. Mterestingly, 

sequences identical to the group 2 IG are also present at the end of the RNA leader and in front 
of the Matrix encoding gene and of ORF 10. 

The spike is a type I glycoprotein, which forms the large spikes on the surface of the virion 
and is responsible for receptor-binding and membrane fusion. (Gallagher (2001) Adv Exp Med 
35 Biol A9A'. 183-92). Hie protein is 1255 residues long with 17 predicted N-gJycosylation sites. It 

has a 13aa leader peptide and a 17 aa C-terminal membrane anchoring sequence (1202-1218). 
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Some (MHV, HCoV-OC43, AIBV and BCoV), but not aU (TGV, EBPV, HCoV-229E) 
coronavirus spike proteins are proteolytically cleaved in two subunits, SI and S2. SI is 
supposed to form tbe bulbous head, which stays non-covalently linked to the C-terminal 
membrane anchor. Qeavage is mediated by a basic aminoacid sequence, which resembles the 
5 consensus sequence for a fimn cleavage site. (Garten et.aL, Biochimie 1994; 76(3-4): 217-225). 
However, in case of this SARS virus isolate, we wrae not able to identify such a sequence, 
implicating that the S protein of this SARS virus isolate is unlikely to be cleaved durin g 
maturation. Secondary structure predictions indicated that the global architecture of the spike 
protein is conserved within all known coronaviruses. The SI domain is mainly formed by beta 
10 sheets and likely adopts a globular fold, while in the S2 domain extensive alpha helical regions 
are predicted. In addition, the LeamCoil-VMF program, specifically designed to identify coiled- 
coil-like regions in viral naembrane-fusion proteins, predicts two coiled-coils within S2, spanning 
aminoacids 900-1005 and 1151-1185, respectively (Figure 19). Both coiled-coil regions contain 
a leucine-zdpper motif, which is also present in the spikes of all coronaviruses. Leucme zippers 
15 are known to promote protein oligomerization; since the spike proteins of TGV and MHV form 
hetero-trimers, (Delmas etal,J Virol 1990; 64(ll):5367-5375) (Godeke, et al., J Virology 2000; 
74(3):1566-1571) it is conceivable that in SARS leucine zippers play a role in promoting and/or 
stabilizing a similar quaternary structure. The spike protein plays a major role in the biology of 
coronaviruses because the S 1 domain contains the receptor-binding domain and the virus 
20 neutralizing epitopes, while the S2 domain is involved in the process of membrane fusion, which 

is essential for virus infectivity. As expected, multiple sequence ali gnmen t of different spike 
proteins showed a major ^gree of variability within the SI domain, whereas S2 is more 
conserved. 

The envelope protein E is a very short polypeptide of 76 aa, involved in the morphogenesis 
25 of the virion envelope. (Godet et al.. Virology 1992; 188(2):666-675). Computer analysis 
predicts a long transmembrane domain close to the N-terminus and two N-glycosylation sites. 
Hie level of aminoacid similarity with other coronavirases is very low and the best homology is 
with the small ravelope protein of the transmissible gastroenteritis virus (TGV). 

The matrix glycoprotein (M) is a 221-residue polypeptide with a predicted molecular 
30 weight of 25 kDa. Computer analysis predicts a topology consisting of a short aminoterminal 

ectodomain, three transmembrane segments and a carboxyl terminus located at the interior side 
of the viral envelope, hi analogy with, the matrix glycoprotein of TGV, that of the avian infective 
bronchitis virus (AIBV) and that of the hypervirulent MHV-2 strain the SARS M glycoprotein is 
N-glycosylated at the N-terminus. SARS M protein shows highest sinadlarity to group 2 viruses 
35 (Table 11). 



- 288 - 




wo 2004/092360 



PCT/US2004/011710 



Rnally, the nucleocapsid protein N is a 397-iesidue-long phosphoprotem that inteacts 
with viral genomic RNA to form the nucleocapsid. The level of conservation with other 
coronaviruses is low, ranging from 26,9% identity with the HCoV-229E to 37,4% identity to the 
Bovine coronavirus (BcoV) (Table 11). Epitope analysis of the nucleocapsid protein has been 
5 carried out (Li e/ of (2003) Geno Prot & Bioinfo 1:198-206) in which the epito^ site at the C 
terminus of the protein was located as SEQ ID NO: 7394 (amino acids 371-407 of SEQ ID NO: 
6052). 

In addition to the above fundamental proteins, many viruses express a set of other peptides, 
which are generally dispensable for viability, but can influence the infectivity potential of the 
10 virus, (de Haan et al.. Virology 2002; 296(1):177-189). These protdns are generally conserved 

within members of the same serogroup, but differ profoundly among the groups. For this reason, 
they are generally referred to as group-specific proteins (Figure 11). Members of the group 1, 
represented here by HcoV-229E, have two group-spedfic genes located between the S and E 
^nes and sometimes one or two ORFs downstream of the N gene, pr^eding the 3’ UTR region 
15 of the genome. Viruses of the group 2, with MHV as prototype, have two group-specific genes 
(2a and HE) between ORFlb and S, as weD as other two between S and E genes. Hnally, the 
group 3 viruses, represented by the prototype AIB V, have two group-specific genes between S 
and E and other two between the M and N genes. 

■ With the exception of the hemagglutinin esterase HE, for which hemagglutinating and 
20 acetyl-esterase enzymatic activities have been demonstrated, all the other group-specific ORFs 
encode proteins whose role has not yet been established. 

Interestingly, the arrangement of specific genes in the SARS genome is peculiar and the 
predicted ORFs do not display any significant homology with ORFs present in the other 
coronaiviruses, nor with any other known protein from different organisms. Like viruses of the 
25 group 1 and 3, SARS lacks the HE hemagglutinin and does not contain ORFs between the 

ORFlb and the S gene. Furthermore, two predicted ORFs (ORF3 and ORF4) are encoded in the 
region between S and E, and supmmpose for most of their length. ORFS has an IG sequence 2 
bp upstream of the ATG start codon. In contrast to the other groups, SARS contains five 
predicted ORFs in the region between M and N genes. ORF7 is located 10 bases downstream of 
30 the stop codon of M gene, and has an IG s^uence 155 nucleotides upstream from the ATG start 
codon. Similarly, ORF8 and ORFIO present an IG rigjit upstream of their ATG start codons. On 
the other hand, the 5’ aids of ORF9 and ORFl 1 shortly superimpose with the flanking genes, 
and for this reason they do not need an IG to activate transcription. ORF12 totally superimposes 
with the N gene and shares very low homology witfi a 22kDa protein of the MHV virus, coded in 
35 the corresponding region. 
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Despite the absence of indications of possible localization and function deriving from 
sequence sinoilarity, ORF3, ORF7 and ORE^ contain hydrophobic segments, sug^sting 
association with membrane stmctuies. hj addition, ORF3, the longest among the SARS specific 
gene, is the only one that racodes for a peptide containing a high number of predicted O- 
5 ^ycosylation sites (Table 1 1). Predicted N-glycosylation sites have been identified in ORF3, 

ORFll andORF12. 

Two shorter ORFs in the non-stmctural regions are SEQ ID NOS: 9965 and 9966. The 
inventioii includes polypeptides with these sequences, and also fragments, variants, etc. 
Phvloeenetic analysis 

10 The substitution frequency within 922 conserved bases from the pol gene of eleven 

coronaviruses from the three differmt sraogroups has been used in the past to show that the 
variability within members of each serogroup is much sinaUra than between members of 
different serogroups, confirming the previously described serological groupings. (Stephensen et 
a/., "Virus Res 1999; 60(2):181-9). We used the 922 bp region of the pol gene of SARS and 
15 aligiied it with the same fragment from other 12 coronaviruses. The tree obtained showed that 
the SARS virus is distinct from the other three groups of coronaviruses (Rgute 20). Similar 
results were obtained using the fuU-length aininoacid sequences of pol. Sd^protease and 
helicase fiem the rephcase region and those of the spike and the matrix glycoproteins fiem the 
stmctural region (data not shown). These data confiimed that the entire genome of the SARS 
20 virus clusters ini' a new group (group 4) of coronavirus. 

To gain more resolution for possible evolutionary relationships we performed the' analysis 
using consMisus Sequences of predicted domains of the proteins, fii particular, we generated 
consensus sequences of the SI domain of the spike protein from the group 1 and group 2 and 
then we compared them to the SI domain of the SARS spike. No consensus could be generated 
25 from the group 3 since only the spike protein of AIBV is known. Interestingly, the tree 

constructed from the alignment of SARS SI with the consensus gmierated from the two groups 
of spike proteins was different from that in Rguie 20, and showed a much closer relationship 
between SARS and group 2 coronaviruses (Figure 21A). Further analysis showed that 19 out of 
the 20 cysteines present in the SARS SI domain are spatially conserved with the group 2 
30 consensus sequence, while only five are maintained either within the group 1 and group 3 

sequences (Rgure 21B). Given the fundamental role played by cysteines in protein folding, it is 
likely that the SI domain of SARS and group 2 coronaviruses share a simil ar spatial 
organization. 
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Sequence variability between SARS coronaviruses 

We compared the FRA sequence to the four complete SARS genomes available on the 
Web. A total of 30 mutations were detected. Nine of these mutations were silent while 21 
resulted in aminoadd substitutions (Table 12). Within ORFla, three silent and seven productive 
5 mutations were detected. In ORFlb, there were 6ve silent and three productive mutations. One 

of the productive mutations was caused by two nucleotide substitutions resulting in a single 
aminoadd change. Five changes were located in the spike protein, four of diese were productive 
and one sUmiL Two productive mutations were in ORF3 and in the matrix glycoprotein M. One 
productive mutation each was in ORFIO and in the nucleocapsid protein N. 

10 The overall difference between FRA and TOR2 was of nine nucleotides resulting in two 

silent mutations and seven aminoacid changes. The difference between FRA and Utbani is 12 
nucleotides, which result in five silent mutations and seven aminoadd changes. For CUHK 16 
nucleotides wmo differrait, five of which were silent mutations. For FRA and HKU 14 nucleotide 
changes resulted in four silent and nine productive mutations. 

15 EXAMPLE 2 -Production, Inactivation and Purification of Whole SARS Virus Using MCS 

Chronudography Resin Purification Followed by Density Gmdient VUracentrifugation 
A SARS isolate FRAl (EMBL: AY310120) was passaged on VERO cells that were 
cultivated in DMEM (Gibco: Cat No. 21969-035, Lot No. 3078864), Penicillin/Sttep (Gibco: Cat 
No. 15070-063, Lot No. 1120042), and 3% FCS (Gibco: Cat No. 10270-106, Lot No. 

20 40F6130K) at 37°C, 5% CO 2 . Trypsin (Gibco: Cat No. 25300-054, Lot No. 3078729) was used 

for detaching the cells. 

For virus production the third passage was used for inoculation of VERO cells at a moi of 
~0.1. Cells were incubated with the virus for 1 h at 37°C in infection medium (DMEM without 
PS, FCS); after Ih cells were washed twice and further incubated at 37°C for 48 h in the presents 
25 of 3% FCS and antibiotics. The supernatant was harvested 48 hours post infection (p.i.) and 
precleared by centrifugation at 3000 rpm at 4®C for 10 min, 

The SARS virus was inactivated by p-propiolactone (BPL) treatment (1:2000) for 18 h at 
4“C, followed by 3 h at 37°C. Testing the virus on successful inactivation, VERO cells were 
incubated with 10 ml BPL treated supernatant for 4 days at 37°C; subsequendally, the 
30 supernatant was transferred to a fresh VERO cell culture and further incubated for another 4 
days. Cells were checked for cytopathic effect (CPJ^. 

200 ml of die BPL-inactivated SARS virus harvest was then clarified using a 0.65 /tm- 
pore-size filter (47 mm diameter) to pass virus particles and retain cell debris. The filter unit was 
coimected to a Masterflex pump, which accomplished a consistent flow rate of 40 ml/min. 
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A MCS Chrvmatoeraphy Purification Step 

The filtered virus suspension was dien subjected to MCS chromatography. The MCS 
column was prepared as follows. 27 ml slurry led to 14 ml sedimentated resin which was packed 
using a Gotec Superformance Column (diameter 1.0 cm, hei^t 15.7 cm, volmne 12.33 ml). 1% 

5 of the column volmne of a 1% acetone solution was injected to the column and the column was 

run with a flow of 100 cm/h. The HETP, N and As values were then calculated as HETP: 0,056 
cm, N / m; 1790 and As = 1.20. 

The amount of proteins in the purified solution after the MCS chromatography step were 
assessed with a bicinchoninic add (BCA) method (Interchim) {see, e.g., 

10 ht^://www.piercenetcom/fUes/bca.pi^ and electrophoresis. 

SDS-PAGE was done in accordance to Laemmli, Nature (1970) 227:680-685. Sarc^les for 
SDS-PAGE were diluted to a protein concentration of 77 /ig/ml. Different protein 
concentrations, were loaded depaiding on the gel types used (10/12/15 Wells, Novex/lhvitrogen): 



Number of Wdls 


Protein Concentration in the Dilution 


Load 


Protdn/Well 


10 Wells 


77 fig/nd 


20 Ml 


1/ig 


12 Wells 


11 


15-20/11 


0.75 -1/ig 


15 Wells 


77 


10 /d 


0.5 /tg 



15 Samples for use in a redudng SDS-PAGE were prepared as follows: 





26 sample or diluted sample 




+ 10 III NuPage Sample Buffer (4x) SDS NP0003 




+ 4 /il TCEP Bondhreaker Solution 77720 
(1:2 in MilliQ water) 


Final V<dume: 


40 III 



The samples were heated for 10 minutes at 70°C or left at room temperature for 1 hour 
(leaviiig the samples at room temperature prevents the M protein of Corona Virus to 

10 coagulate/forming complexes), and then centrifuged for approximately one minute at 14,000 rpm 
in a table top centrifuge. 

Mark^ for use on the gel were prepared as follows. Gel bands containing less than 1 fig 
of proteins were easily visualised with the silver staining procedure using the SUver Staining Eat 
Protein, Plus One Staining Protocol (Pharmacia Biotech). 

15 Western blotting was performed as follows. A semi-dry blotting technique was used to 

transfer the proteins from the SDS gel to a nitrocellulose membrane. The transfer was performed 
with a current of 0.8 mA/cm^ for 1 hour. A rabbit polyclonal antibody against SARS virus was 
used to perform the immimo probing using the Western Bre^e, Novex Chromogenic Western 
Blot Immunodetection Kit (Novex/Invitrogen). 

JO The chromatogram of the inactivated SARS MCS capture step is depicted in FIGURE 27. 

To estimate purity, MCS chromatography fractions were analysed by silver staining on NuPage 
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10% or 4-12% Bis-Tiis-Gel (Novex) under reduced conditions, heated for 10 minntftg at 70“C 
(Rgure 28). The fractions were also analysed under the same conditions by weston blot (Rgure 
29) to estimate purity, using PAK 11/03 SARS Cov 270603 neutralizing titer 1:512 (this 



antibody was used for this and subsequent western blots). Purity estimates are as follows: 



Sample 


yolnme 

/ml 


[Protein] 


Total Protein 


Step Recovery 
Protein/ % 


Corona Harvest 


100 


2547.6 


254.76 


100 


After nitration = Load 


100 


2440.3 


244.03 


95.8 


Flow Through 


85 


2321.4 


197.32 


njs 


Wash 


49.32 


468.5 


23.11 


9.1 


Peakl 


12.12 


252.7 


3.062 




Total Recovery 


- 


- 


464.4 


86.5 



B. Density Gradient Ultracentrifu 2 ation Step 

ITie eluted SARS virus fiiaction was then subjected to density gradient ultracentrifugation 
with a swinging bucket rotor to further purify the inactivated virus. 3 ml of MCS peak fraction 
were.loaded onto a linear gradierir(i5-60% sucrose; 17 ml 15% and 17 ml 60% sucrose in 
10 gradient mixer). The separation was performed with a Beckman SW 28 rotor at 20,000 ipm for 
2 hours. 

The content of sucrose and protein in the. linear density gradient ultracentrifugation 
fractions are depicted in the following table, the graph in figure 30 and the estimation of purity in 
figure 31: 



Fraction 


Fraction / ml 


[Sucrose] / % 


rProtein] / iivJvd 


1 


2 


61 


96.12 


2 


2 


59.4 


98.62 


3 


2 


57.5 


87.63 


4 


2 


54.5 


86.91 


5 


2 


50.5 


79.9 


6 


2, 


47.2 


74.3 


7 


2 


43.7 


68.05 


8 


2 


40.2 


60.43 


9 


2 


37.2 


57.38 


10 


2 


34 


53.12 


11 


2 


30 


50.63 


12 


2 


25.7 


35.02 


13 


2 


22.4 


35.33 


14 


2 


19.5 


39.25 


15 


2 


15.5 


69.79 


16 


2 


8.5 


169.03 


17 


2 


8.5 


128.96 



Tlie protein concentration of fraction 11 (Figure 31 SDS-gel) was measured again against a 
standard curve prepared in 30% sucrose and lead to a protein concentration of 3.67 /ig/ml (0.05 
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jig on the gel). The M protein q)pears to be nussing in fliis preparation possibily due to sample 
treatment procedure (heated samples). 

Tliere may be discrepancies in the protein concentration measurements in Table 2 due to 
sucrose interference with this assay. 

5 EXAMPLE 3 -Production, Inactivation and Purification of Whole SABS Vin^ Using MCS 
Chromatography Resin Purification Followed by Density Gradient Ultracentrifugation 
Inactivated SARS virus was prepared as described in Example above. 

A MCS Chromatography Purification Step 

In this example, 200 ml of inactivated SARS virus harvest were subjected to MCS 
10 chromatography. The chromatogram of the capture step of inactivated SARS virus purification 
with MCS is depicted in FIGURE 32, the protein recovery in the following table and the 



estimation of purity in FIGURE 33: 



Sample 


Volume 

/ml 


[ft*otein] 
/ ne/ml 


Total Protdn 
/mg 


Step Recovery 
Protdn/ % 


Corona Virus Harvest 


200 


2239.2 


447.83 


100 


After Filtration = Load 


200 


2245.1 


449.02 


100.3 


Flow Through 


185 


2126.3 


393.37 


87.8 


Wash 


49.32 


450.1 . 


22.2 


5.0 


Peakl 


4.43 


1245.6 


5.52 


1.2 


Total Recovery 


- 




421.08 


93.7 



B. Density Gradient UltracentrifUsation Step 

15 3.5 ml of MCS peak firaction were then loaded onto a linear gradient (15-40% sucrose: 16 

ml 15% and 16ml 40% sucrose in gradient mixer). The separation was performed with a 
Beckman SW 28 rotor at 20,000 rpm for 2 hours. 

The content of sucrose and protein in the linear density gradient ultracentrifugation 
fiactions are depicted in the following table and the graph in FIGURE 34: 



Tube 


Fraction Size / ml 


[Sucrose] / % 


[Protein] / ushni 


1 


2 


40 


45.86 


2 


2 


39 


45.68 


3 


2 


37.5 


44.14 


4 


2 


35.5 


37.82 


5 


2 


33.5 


34.48 


6 


2 


31.5 


31.76 


7 


2 


30.5 


29.49 


8 


2 


28 


30.87 


9 


2 


25.5 


31.7 


10 


2 


23.5 


26.74 


11 


2 


21.75 


. 23.58 


12 


2 


20 


35.33 


13 


2 


18 


96.38 


14 


2 


14.5 


523.79 
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15 


2 


8 


941.97 


16 


2 


8 


696.7 



Protein recovery is shown in the following table and the estimation of purity is shown in figure 
35. Electron Mcrograph pictures of density gradient fiactions 8 , 9 and 10 are shown in figure 



36; 



Step 


Vohune/ml 


Protein /{ts/ml 


Total Protein/nie 


Step Protein % 


Load 


3.5 ml 


1245.6 


4359.6 


100 


Bulk Protein Practions 


3.5 ml 


720.8 


4324.9 


99.2 


Viral Peak Fraction 


8ml 


29.7 


237.6 


ss 


Total Recovery 


• 




4562.5 


104.7 



5 

EXAMPLE 4 - Mouse Immuniz/ation with Inactivated SARS Virus 

Mice were immunized subcutaneously on days 0, 14, and 28 with 5 pg BPL-inactivated 
SARS-CoV particles (BPL-SARS-GoV), either alone or together with Alum or MF59 as 
adjuvants. Serum w^ collected on days 0 (pre-immunization), 13 (post 1st immunizati on), 28 
10 (post 2 nd), and 35 (1 week post 3rd immuniz ation). Neutralizing antibodies were assessed for 
blocking SARS-CoV infection of Vero cells in vitro. After 3 immunizations, neutralization titers 
were in the range 1:100-1:1000, which are levels s imil ar to Aose present in the serum of SARS 
convalescent patients. As shown in the following table, the non-adjuvanted vaccine induced 
neutralizing antibody after the third immunization, and potency of this vaccine was enhanced 
15 significantly by including the adjuvants, with neutralizing antibody appearing after then 2 nd 
immunization and overall titers increasing after then 3rd immunizati on: 



Immunogen 


Neutralization Titer I 


pre 


post 1st 


post 2nd 


post 3rd 


BPL-SARS-C 0 V+MF 59 (5 Mg) 


<1:20 


<1:20 


1:158 


1:630 


BPL-SARS-CoV+Alum (5 Mg) 


<1:20 


<1:20 


1:67 


1:612 


BPL-SARS-CoV(5Mg) 


<1:20 


’< 1:20 


<1:20 


1:71 


PBS 


<1:20 


<1:20 


<1:20 


<1:20 



EXAMPLE 5 - BaJb/cMouse Immunizfftion with Inactivated SARS Virus 

A Balb/c mouse model for SARS infection has been developed (Subbarao et al. (2004), 

20 J.ViroL, 78:3572-77. In this model, Balb/c mice are inoculated intranasally with 10 '* TCID 50 of 
virus. At 48 hours post-inoculation, a 2-log increase in the TCID 50 virus titor can be detected in 
the lungs of infected mice. While virus replication is readily detected, the mice do not show any 
SARS disease syrr^toms and spontaneously clear the virus one week after inoculation. A 
decrease in virus titer in previotisly-immunized animal s as compared to control animals 
25 demonstrates a protective effect of the vaccine being evaluated. 

In this example, four Balb/c mice per group are inomunized three times widi 5 pg BPL 
inactivated SARS-CoV (days 0, 14, 28) either alone or in combination with MF59 and 
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chaUenged with 10^ TCID50 of SARS-CoV on day 43. Two days foUowing virus challenge the 
mice are eudianized and SARS-CoV is quantified from nasal turbinates (NT) and lungs and the 
mean virus titer for each mouse is measured. Control groups received PBS alone, or an inflnf»n7.a 
virus vaccine (FLU) with or without MF59 adjuvant Data were as follows (see also Rgure 51), 

5 where four mice were tested per group and vims titers are expressed as logio TCTOso per gram of 
tissue: 



Immanogen 


Virus replication in hmgs of 
challenged mice 


Virus replication in nasal 
turbinates of challenged mice 


# infected/ 
#tested 


Mean (±SE) 
virus titer 


# infected/ 
#tested 


Mean (± SE) 
vims titer 


PBS 


4/4 


6.3 ±0.3 


3/4 


2.8 ±0.35 


MF-59 alone 


4/4 


6.1 ±0.13 


3/4 


3.0 ±0.38 


H^U vaccine (5 hr) 


4/4 


6.3 ±0.07 


3/4 


2.9 ±0.36 


FLU vaccine (5 /tg) + MF-59 


4/4 ; 


6.0 ±0.19 


4/4 


3.0 ±0.11 


BPL-SARS-CoV (5 ug) 


‘4 


1.6 ±0.13’ 


0/4 


Not detected ** 


BPL-SARS-CoV (5 /ig) + MF-59 


■ 0/4 


Not detected* 


0/4 


Not detected 



Two-tailed Study’s t-test, compared toPBS-immuoized mice, showed: • P<0.00001 or ** P=0.025 • 



. As shown, virus could not be detected in the BPI^SARS-CoV immunized mice. The lower 
10 limit of detection of infectious virus in a 10% w/v suspension of limg homogenate was 1.5 
logioTClDso/gm, and in a 5% w/v suspension of nasal turbinates the limi t was 1.8 
logioTCIDso/gm. Viral tit^ in the immunized mammals were thus below these threshold values. 

Thus the inactivated SARS-CoV vaccine was very efficient at preventing vims infection, 
as only one of eight mice immunized with the vaccine, either with or without MF59 adjuvant, 

1 5 was infected. Similar protection was not observed in control groups of PBS diluent, MF59 

adjuvant, or influenza virus vaccine with or without adjuvant •• 

Neutralization titers of sera taken from the animal s in the challenge study were assessed at 
two weeks post-lst one week post-2nd, and one week post-3rd immunization. Mice immnniTftH 
with the vaccine with MF59 adjuvant had already developed a neutralization titer of 1 :71 after 
JO the 2nd immunization, which increased to 1:588 after the 3rd immunization, whereas mice 
receiving the unadjuvanted vaccine did not have any neutralizing activity post-2nd and a 
neutralization titer of 1:64 post-3rd immunization. Sera from mice in each of the control groups 
did not show any neutralization activity. These data clearly demonstrate not only the ability of 
the inactivated SARS-CoV vaccine to induce protective levels of SARS neutralizing antibodies, 
15 but also a beneficial effect of formulating the vaccine with adjuvant for elevated neutralization 
titers. 

EXAMPIM 6 - Preparation ofOMV comprising SARS viral antigens 

Ecoli were transfected with a plasmid of interest (encoding a SARS viral antigen). Single 
colonies harbouring the plasmid of interest were grown overnight at 37®C in 20 ml of LB/Amp 
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(100 /tg/ml) liquid culture. Bacteria were diluted 1:30 in 1.0 L of fresh medium and grown at 
eidiCT 30®C or 37°C until the OD550 reached 0.6-0.8. Expression of recombinant protein was 
induced with IPTG at a final concentration of 1.0 mM. After incubation for 3 hours, bacteria 
were harvested by centrifugation at 8 000 x g for IS minutes at 4°C and resuspended in 20 ml of 
5 20 mM Tris-HCl (pH 7.5) and complete protease inhibitors (Boehringar-Mannheim™). AH 

subsequent procedures were performed at 4"C or on ice. 

Cells were disrupted by sonication using a Branson Sonifier 450 and centrifuged at 5 000 x 
g for 20 min to sediment unbroken cells and inclusion bodies. The supernatant, containing 
membranes and cellular debris, was centrifuged at SOOOOg (Beckman Ti50, 29 000 rpm) for 75 
10 min , washed with 20 mM Bis-tris propane (pH 6.5), 1.0 M NaCl, 10% (v/v) glycerol and 

sedimented again at SOOOOg for 75 minutes. The pellet was resuspended in 20mM Tris-HCl (pH 

7.5) , 2.0% (v/v) Sarkosyl, complete protease inhibitor (l.Q mM EDTA, final concentration) and 
incubated for 20 minutes to dissolve irmer membrane. Cellular debris was pelleted by - 
centrifugation at 5000g for 10 min and the supernatant centrifuged at 75000g for 75 minutes 

15 (B^kroan Ti50, 33000 rpm). Outer membrane vesicles were washed with 20 mM Tris-HCl ^H 

7.5) and centrifuged at 75 000 x g for 75 minutes or overnight The OMV was finally 
resuspended in 500 ^tl of 20 mM Tris-HQ (pH 7.5), 10% v/v glycerol. Protein concentration 
was estimated by standard Bradford Assay (Bio-Rad), while protein concentration of iimer 
membrane fraction was determined with the DC protein assay (Bio-Rad). Various fractions from 

20 the isolation procedure were assayed by SDS-PAGE. 

EXAMPLE 7.- Immunogenicity, dose and route schedule for recombinant Spike protein in 
mice 

The inununogenicity, route and dosing of the recombinant spike proteins of the invention 
in mice may be assessed using the below detailed protocol. Preferably, the administered antigen 
25 will elicit neutralizing antibody titers at least in the range of 1/100-1/1000. Increasing rioses of 

antigen can be tested in the range from 5 to 20 ^g of recombinant Spike antigen alone or mixed 
with an equal volume of MF59-citrate, administered SC or IM to anesthetized mice in 100 pi of 
inoculum. Groups of BALB/c mice, 6 p^ treatment are primed at day 0 and boosted at day 14 
and 28. 
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Group 


Treatment 


Dose/Route 


Sampling Interval 


Number of mice 


1-3 


Rec-Spike protein 


20 . 10, 5;/g/SC 


7. 21,35, 42 d 


6 per dose level 


4-6 


Rec-Spike protein 


20, 10,5;/g/SC 


7 


6 per dose level 


7-9 


Rec-SpIke protein 


20, 10, 5//g/lM 


7, 21, 35, 42 d 


6 per dose level 


16-12 


Rec-Spike protein 


20 , 10 , 5/yg/IM 


7 


6 per dose level 


13-15 


Rec-Spike - MF59 


20,10,5^g/SC 


7, 21. 35, 42 d 


6 per dose level 


16-18 


Rec-SpIke - MF59 


20, 10, 5;/g/SC 


7 


6 per dose level 


19-21 


Rec-Spike - MF59 


20, 10, 5;/g/IM 


7, 21, 35, 42 d 


6 per dose level 


22-24 


Rec-Spike - MF59 




7 


6 per dose level 


25 


MF59 


NA/SC 


7, 21, 35. 42 d 


6 - 1-6 (sac d 7 and 42) 


27 


MF59 


NA/IM 


7, 21, 35, 42 d 


6 6 (sac d 7 and 42) 


29 


Saline 


NA/SC 


7, 21, 35. 42 d 


6 - 1-6 (sac d 7 and 42) 


31 


Saline 


NA/1M 


7. 21. 35,42 d 


6-^6 (sac d 7 and 42) 



This protocol can also be used to assess the Thl/Th2 profile of the specific immune 
response elicited by the recombinant Spike protein. Neutralizing and Spike-specific antibody 
tit^ will be assessed at days 7, 21, and 35; ]^2a vs ^G1 isotype of the Spike-specific . 

5 antibodies will be determined at days 21 and 35; in vitro proliferation of lymph node and splenic 
T cells against the recombinant Spike protein will be determined at days 7 and 42, respectively; • 
IFN-y and BL4 production by splenic T cell against the recombinant Spike protein fix>m SARS- 
CoV will be assessed at day 42. Peripheral blood will be collected at days 7, 21, 35; lymph 
nodes cells at day 7, and spleen cells at day 42. Neutralizing and Spike-specific antibody titers 
10 and isotypes will be determined by inhi bition of SARS-CoV infection of Verb cells and by 

ELISA, respectively. Proliferation of lymph node and splenic cells will be determined by ^[H]- 
Thj^dine uptake. Frequencies of splenic IFN-y and EL-4 producing T lymphocytes, will be 
deteniuned by ELISPOT and FACS. 

EXAMPLE 8 -Immunogenidty, dosing and route schedule for Spike protons in rabbits 
15 The immunogoiicity, route and dosing of the recombinant spike proteins of the invention 

in rabbits may be assessed using the below detailed protocol. Increasing doses can be tested in 
the range from 5 to 40 /ig of recombinant Spike antigen alone or mixed with an equal volume of 
MF59-dtrate, administered SC or IM to anesthetized animals in 200pl of inoculum. Groups of 
New Zealand white female rabbits, 10 per treatment, will be immunized as shown in the table 
20 below. The anim als will be primed at day 0 and boosted at days 14 and 28. Peripheral blood 
will be collected at days 7, 21, and 35. Neutralizing and Spike-specific antibody titers wifi be 
determined by inhibition of SARS-CoV infection of Veto cells and by ELISA, respectively. 
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Group 


Tr^tment 


Dose/Route 


Sampling interval 


Number of rabbits 


1-4 


Full-length Spike protein 


40, 20, 10, 5pg/SC 


7, 21,35d 


10 per dose level 


5-8 


FulHength Spike protein 


40, 20, 10, Sipg/IM 


7 , 21, 35 d 


10 per dose level 


9-12 


Truncated Spike protein 


40, 20, 10, 5pg/SC 


7, 21, 35 d 


10 per dose level 


13-16 


Truncated Spike protein 


40. 20. 10, 5//g/IM 


7,21, 35 d 


10 per dose level 


17-20 


FulHength Spike protein - MF59 


40, 20, 10, 5|ig/SC 


7,21, 35d 


10 per dose level 


21-24 


FulHength Spike protein - MF59 


40, 20, 10, 5»/g/iM 


7, 21, 35 d 


10 per dose level 


25-28 


Tmncated Spike protein - MF59 


40. 20. 10, SpgfSC 


7, 21,35d 


10 per dose level 


29-32 


Truncated Spike protein - MF59 


40. 20. 10, SfjgnM 


7 , 21, 35 d 


10 per dose level 


33 


MF59 


NA/SC 


7, 21, 35 d 


10 


34 


MF59 


NA/IM 


7, 21,35d 


10 


35 


Saline 


NA/SC 


7 , 21, 35 d 


10 


36 


Saline 


NA/IM 


7, 21, 35 d 


10 



EXAMPhE 9 - ImmunogerUcity and dose schedule for recombinant Spike in ferrets ' 

The immunogenicity and dosing of the recombinant spike proteins of the invention in 
ferrets may be assessed using the below detailed protocol. Three groups of ferrets, 6 for 
5 treatment, will be immunized with recombinant SARS-CoV Spike protein from CHO cell lines, ' 
alone or mixed with an equal volume of MF59-citrate, administered SC to anesthetized animals 
in 200pl of inoculum. The recombinant Spike protein vaccine will be tested at the dose eliciting 
the highest neutralizing antibody titers in mice at day 35 after the second boost. The animals 
will be primed at day 0 and boosted at day 14 and 28. Peripheral blood wiQ be collected at days . 

10 7, 21 , and 35. Neutralizing and Spike-specific antibodies titers wiU be determined by inhibition 

of SARS-CoV infection of Veto cells and by ELES A, respectively. 



Groups 


Treatment 


Dose/Route 


Sampling interval 


Number of ferrets 


1 &2 


Rec-Spike protein 


Y//gor2Y;ig/SC 


7, 21,35d 


6 


3&4 


Rec-Spike protein + MF59 


Y//g or2Y//g/SC 


7, 21. 35 d 


6^ 


5 


Saline 


NA/SC 


7. 21,35d 


6 



The 3 groups of ferrets, 6 animals per group, used for die immun ogenicity studies above 
can then be used to assess efficacy of the recombinant Spike protein in protecting vaccinated 
15 animals firom infection and/or disease. Anestethized animal s will be challenged two weks after 

the last boost intratracheally with 10 ^ median tissue culture infectious dose unit (TCID 50 ) of the 
SARS-CoV Utah strain. Infection by SARS-CoV will be assessed by taking nasal, faringeal and 
rectal swabs from animals for 20 days after challenge as described (12). The presence of SARS- 
CoV in sample mateials will be assessed by RT-PCR and infection assay of Veto cells. 

20 Animals wiU be monitored for clinical signs of SARS disease by assessing sleeping time, 
temperature, respiratory symptoms, diarrhea, body weight and survival. Protection will be 
determined by the magnitude and duration of virus shedding and by duration and severity of 
disease symptoms and percentages of surviving animals. 
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EXAMPLJS 10: Expression of Spike protein for vaccinotion 

The SARS-CoV Spike glycoprotein was ^pressed in both full-length and truncated forms, 
using the nSh and nShATC pCMVni constmcts described above, both with hexahistidine tagg 
The vector constmcts were evaluated for expression 48 hr after transfection into 293 cells and 
5 COS7 cells. The fuU-length Spike protein (nSh) was detected by western blot only in ceU lysate, 
but not in culture media OFigure 52). 

The majority of SARS-CoV full-length Spike protein was expressed in 
transiently-transfected COS7 cells as a high molecular glycoprotein which ran at 540 kDa in 
non-reducing gels (Rgure 53). The gp540 is heat labile as indicated by the complete dissociation 
10 into monomeiic forms (gpl70 & gpl80) by boiling, but it was resistant to DTT treatment These 
data suggest that the recombinant Spike protein is noncovalently associated into a homotrimer 
(gp540). Ibe presence of Spike protein in homotrimeric association also was confirmed in 
inactivated, purified SARS-CoV virion particles. Analysis of virion proteins by western blot 
under the same condition used for the characterization of recombinant Spike piotdn generated 
15 essentially identical results (Figure 54). 

EXAMPLE 11: Spike protein processing 

In order to characterize Spike protein processing, BHK-21 cells were infected with 
alphavirus replicon particles expressing the Si^-CoV full-length Spike. At 6 hoursr post- 
infection with an MOI of 5, infectedcells were labeled for 1 hr with L-[^^S]methionine/cysteine 
to and chased for up to 4 hours. The [^^S]-labeled spike protein was immunoprecipitated by anti- 
SARS rabbit serum and digested with Endo-H. Both digested and undigested proteins were 
analyzed by SDS-PAGE (4% polyacrylamide). As shown in Figure 55, the full-length ^ike 
protein is synthesized as an Endo-H sensitive high-mannose glycoprotein (gpl70, an ER form) 
fliat undergoes modification to an Endo-H resistant glycoprotein with complex oligosaccharides 

:5 (gplSO, a Golgi form). The conversion of gpl 70 into the ajl80 form takes place within 2 hours 

(Figure 56). 

EXAMPLE 12: High-level protein expression 

To develop a system for rapid expression of protein antigens, DNA transfection of 293 
(human embryonic kidney) cells was used, tp obtain milligram quantities of recombinant 
0 antigen. The most common method for culturing and transfecting 293 cells is in static or 

monolayer cultures. These procedures were modified by performing large-scale transfection of 
293 cells in suspension and expanding the transfected cells in suspension culture-for production 
of secreted or intraceUular proteins. Several initial experiments were performed at the 100- 
milliliter scale cultures to determine optimum conditions, such as number of cells, type of 
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transfecting reagent (EniGENE 6, Lipitoid or RO-1538) and the ratio of DNA to transfection 
reagent Based upon pilot experiments, FuGENB 6 was the best transfecting reagent. 

Ihe kinetics of gene expression was compared to other viral envelope glycoproteins, and 
the data suggest that stable protein expression peaks around 72 to 96 hours post-transfection, 

5 depending upon the gene of interest and then significantly decreases thereafter. Thus, using the 
optimum conditions, the transfection process was scaled from 100 ml to 4 liters. Ihe 4 liter 
culture can be used for rapidly producing 2-10 milligrams of protein antigens. To facilitate 
antigen purification and also maximiz e the yield and recovery of the purified protein, 
transfection conditions w^ optimized by using serum-free medium. 

10 Bulk transfection procedure has been used for the expression of truncated and full-length ' 

Spike antigens. The kinetics of expression for truncated form of the spike protein is presented in 
Figure S6A. Expression of the truncated form of Spike protein peaked around 48 hrs and was 
stable until 72 hrs, therefore the cultures were, harvested at 72 hrs post transfection. 

, Collected .media were concentrated 20X and used for purification of truncated Spike 
15 protein by a very simple purification strategy where the truncated form of the spike was captured' 

on GNA lectin followed by DEAE and ceramic hydroxyapatite col umn chromatography. The 
purified protein was analyzed on SDS-PAGE by silver stain (Figure 56B) and also by western 
blot (Figure 56C). Early efforts were able to purify the truncated form of the spike protein with 
>95% purity and approximately 50% recovery. The molecular mass of the truncated form of the 
20 Spike protein is ^proximalely 170-180 kDa. 

Full-length Spike protein was expressed in 293 cells using the bulk transfection strategy. 
The expression data suggest that, like the truncated form, expression peaked around 48 hrs post- 
transfection and remained stable until 72 hrs. However, contrary to the truncated form and as 
expected, full-length protein is not secreted, but rather is retained within the cells, as shown by 
25 the absence of any signal in western blots of cell culture supernatants. The fuU-length form of the 
protein was purified from Triton X-100 detergent-extracted cells. Full-length Spike protein was 
then captured on GNA lectin, followed by hydroxyapatite and SP chromatography. The 
calculated molecular mass of fuU-length spike protein is approximately 600 kDa, which is close 
to the theoretical mass for the trimer. 

30 EXAMPLE 13: SARS virus seed cultures 

A SARS-CoV reference seed virus propagated only in certified Vero cells will be used for 
the generation of the Master and Working Virus Seeds under GMP. A clinical specimrai from the 
respiratory tract of a patient infected by the SARS-CoV is inoculated onto documented VERO 
cells, with certified culture media. Culture media containing the virus are harvested at 4 days 
35 post-infection and designated Passage 1 (PI). A second round of virus propagation is again 
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performed in certified YERO cells with certified media, by inoculation of 1 ml per T-75 flaslf of 
100 times diluted PI virus. Culture supernatant was harvested at 3 days post-infection and stored 
at -80°C as a P2 reference stock virus, without plaque purification. 

Cell banks of Veto ceUs for further production of SARS-CoV are prepared from specific 
5 cell subsets that have not been used since the emergence of transmissible spongiform 

enephalopathies (e.g. since 1980). A research cell bank of these cells has been prepared using 
specified New Zealand-oiigin fetal bovine serum. Prom this research ceU bank, a Master Cell • 
Bank (MCB) is made under GMP conditions and using only specified and well-controlled media 
and supplements. The cell bank will is tested for absence of adventitious agents according to 
10 q>plicable US, EU, and international guidelines (see Points To Consider ’’Characterization of 
cell lines used to produce biologicals”, FDA/CBER 7/1993; ICH Q5D Draft 6 ’’Cell substrates”, 
Oct.23, 1996; CPMP/ICH/294/95 "Note for Guidance on Quality of Biotechnological Products; 
Derivation and Characterisation of Cell Substrates Used for Production of 
Biotechnological/Biological Products. (Step 4, 16. July 97); WHO final draft ”Requirements for 
15 use of animal cells as in vitro substrates for the production of biologicals” 7.3.1997). ■ 

Tumorigenicity and identity testing is also required for this cell bank. 

The reference virus is plaque-purified and expanded in certified Vero cells in the absence , 
of PCS in order to ^nerate Master arid Working Seeds. Another option to help ensure purity and • 
facilitate the assessment of safety of the Master Seed is to subject the SARS-CoV to pelleting 
20 and resuspension in PBS. The virus suspension is made up to 60% (w/w) sucrose with crystalline • 
sucrose, transfmied to a centrifuge tube and overlayed with 50, 40, 30, and 20% (w/w) sucrose 
solutions in PBS. The gradient is centrifuged for 72h and then fi^tionated. The virus-contaming 
fiaction is diluted and the virions re-peUeted by ultracentrifiigation. RNA from tire virus pellet is 
isolated and transfected into certified Veto cells whereby the “infectious” positive-strand RNA 
25 will lead to the production of infectious virus, which can be plaque-purified and expanded to 
generate alternative Master and Woridng Seeds from purified virus RNA. 

Viral seeds are tested for the absence of adventitious agents (see e.g. 21 CPR Revised as of 
April 1, 1994, § 630.35 Test for safety) and for identity, using a highly-specific neutralizing 
antiserum prepared from an independent source. Safety testing of viral seeds for vaccine 
10 purposes is done routinely by service laboratories. Broad-spectrum PCR testing can be used as 

an addition and/or alternative for testing. 

EXAMPLE 14: Scale-up of virus production and inactivation 

A protocol for the production, inactivation, and purification of inactivated SARS-CoV with 
sufficient structural integrity to elicit protective neutralizing antibody responses in aaimal 
15 models involves: Vero.cells are infected with virus at an M.OX of 0.01 in the absence of PCS 
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and antibiotics; culture medium is collected, cleared by centrifugation, and inactivated with 6PL, 
followed by confirmatory testing for complete inactivation; the inactivated matmial is filtered, 
subjected to MCS-column purification, and further purified by sucrose gradient centrifugation. 

Sevraal modifications and improvements can be developed when adapting this basic 
5 protocol to a lar^r scale for conunercial use. Fiistly, the cell culture and infection process can be 
adapted to roller bottles, as an intermediate step to allow rapid production for preliminary trials 
within existing BSL 3+ facilities. Full commercial production will typically use a fermentation 
process in a closed system, but a roller bottle system can be achieved more rapidly. The roller 
bottles do offer a true suspension culture systern for Vero cells, which gives various technical 
10 and safety advantages over microcanior cultures. Suspension cultures can be grown to any 

desired fermentation scale without interfering with the closed system between cell passages, as 
no tiypsinization is required. 

To scale up the infection process in roller bottles to 30-50 liters per batch, the optimum 
M.Oi and harvesting periods for selected media and culture conditions shotild first be 
15 determined. For the larger scale, methods for harvesting and handling larger volumes of highly 

infectious material safely, should be used, and so cell separation via centrifugation should be 
replaced by a method such as filtration through single-use filfe cartridges. 

The MCS-chromatography and the gradient purification steps described above can readily 
be scaled to a batch volume of up to 50 liters. For larger volumes, however, and for increased 
20 purity, ultrafiltration and sterile filtration steps wiU be used. Nuclease treatment to remove host 
cell DNA will also be included. 

EXAMPLE 15: Large scale analytical methods 

Analytical methods for the SARS coronavitus include virus titration methods, 
immunological and physico-chemical methods to quantitate and characterize the purified antigen 
25 (ELISA, PAGE, western blots using specific antisera against purified whole virus, etc.). Other 
analytical tests include: fast yield testing via asymmetric field flow separation and laser particle 
detection and counting; Western blot using specific antisera against individual viral proteins; and 
tests for residual host cell DNA. 

Residual DNA testing is generally done by hybridization e.g. using a limi t test. Such 
30 testing is performed according to methods already established and validated for other cell lines. 
As an alternative, the Threshold™ method may be used. 

For producing specific antibodies, recombinant protein expression of aU the ORFs from the 
structural and non-structural gene regions of the SARS-CoV is used. The ORFs can be cloned 
and expressed in E.coli and, if necessary, also in eukaryotic vectors such as baculovirus. This can 
35 provide sufficient amounts of purified soluble protein to immunize mice and rabbits to produce 
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polyclonal and monoclonal antibodies against SARS proteins and to set up specific ELISA 
assays. Different expression vectors can be tested to maximize the yield of recombinant protein 
in a soluble form e.g. different vectors, one containing sequences coding for six N-terminal 
histidine residues and another containing a Glutathione-S-transferase protein fused to die 
5 C-tenninus of the SARS protdn. The recombinant proteins can be purified by single step column 
chromatogrqihy on either Nickel chelating Sepharose or Glutathione-Sepharose 4B resin. These 
procedures are very n^id and generally produce protein of 60-90% purity, which is suitable for 
raising specific antisera (Pizza et al. (2000) Science 287:1816-20). I^ve mice and two rabbits for 
each recombinant protein can be immunized SC with 20 and SO /rg recombinant protein, 

10 respectively, given in IFA as adjuvant, at day 0, 14 and 28. Sara are collected at day 7, 21 and 35 
to assess specific titers before euthanasia of the a nimal s for collection of blood and removal of 
spleens. 

For die detection of impurities (e.g. Vero cell derived proteins) in the vaccine preparation, 
rabbit serum reactive against Vaco-derived proteins can be used. Such antisera are obtained by 
15 immunizi ng rabbits with at least 10/tg of Vero cell lysate with CFA/IFA. The sera can be 

verified for reactivity against Vero-derived proteins in western blots. For more specific antisera 
against specific relevant cell-derived proteins- that tend to be co-purified with die virus, mock- 
infected ceU culture harvest that have undergone the purification process can be prepared and 
used for immunizing rabbits. 

20 Methods to determine neutralization titers of sera fiom immunized animals and humans 

•can be developed, without the constraints of using infectious SARS-CoV in a BSL-3+ 
laboratory. One such strategy will be to use recombinant antigens, particularly Spike protein or 
Spike-derived epitopes, and to develop ELISA assays for measuring antibodies against the target 
protein. Suitable epitopes allow a correlation to be established between the ELISA values and 
25 virus neutralization assay values. This approach provides a faster and more efficient (higher- 

throughput) comparison of specific and protective antibody titers. This ELISA test is also the 
ideal tool to monitor specific antibodies in safety trials, where several hrmdred animal sera must 
be tested. 

Another strategy is to combine stractural elements from both the pathogenic SARS-CoV 
30 and the non-pathogenic coronavirus mouse hq>atitis virus (MHV) to construct chimeric virus- 
like particles (VLPs) that can be labeled. The assay is based on fusion between octadecyl 
rhodamine (R18)-labeled VLPs and cells (Hoekstra et al. (1984) Biochemistry 23:5675-81). The 
method relies on the relief of fluorescence self-quenching of R18 incorporated into VLPs upon 
fusion with cellular membranes. Coronavirus VLPs have been shown to mimic native virions 
35 with respect to their appearance in the electron microscope (EM) and their biological activities. 
As they do not contain viral RNA, however, then they caimot cause a productive infection 
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(Vennema et oL (1996) EMBO J lS:2020-2028). The VLP system can be used for the mouse 
hepatitis virus (MHV) strain A59 (MHV-A59) )Godeke et oL (2000) J Virol 74:1566-15) 
containing a chimeric S piotdn. The protein chimera, consisting of the ectodomain of the SARS- 
CoV and the transmembrane and endodomain (64 C-teiminal amino acid residues) jBrom the 
5 MHV spike protein, can be co-expressed with the MHV M (membrane) and E (envelope) protein 

in OST-7 cells )Godeke et oL). VLPs secreted in the supernatant are harvested, purified and 
labeled with octadecyl riiodamine (R18) (Hoekstra et al). A constant amount of VLPs is 
incubated with a serial dilution of sera at 37“C for 1 hour in a 96-well plate. Subsequently, ceDs 
expressing the receptor for the SARS-CoV, the angiotensin-converting enzyme 2 (ACE2) (li et 
10 oL (2003) Nature 426:450-54) is be added and the extent of fusion can be measured with a 
fluorescence spectrophotometer. 

A final strategy to monitor the ability of sera to inhibit cell-cell fusion interactions between 
cells expressing the SARS-CoV S protein and a human cell line- expressing the angiotensin- 
converting enzyme 2 (ACE2), a functional receptor for SARS-CoV (li et al.). This reporter 
L5 gene-based assay uses the fluorescent shift (green to blue) of the fluorogenic substrate CCF2/AM 

(AM=acetoxymethyl) upon cleavage by P-lactamase (Bla) as read-out for cell-cell fusion 
(Zlokamik et oL (1998) Science 279:84-88). For this assay, a BHK-derived cell line, stably 
expressing Bla and the SARS-CoV S protein is generated. In addition, a hnniaTi cell line 
expressing ACE2 on its surface is used. BHK ceUs, expressing the S protein on their surface and 
M) Bla in their C 340 S 0 I are incubated with serial dilutions of the sera to be tested for Ih at 37°C. The 

cell line expressing the ACE2 is loacted with IpM CCF2/AM for 1 h at 22®C, washed twice with 
PBS, and co-cultivated with the BHK cells. In case of cell-cell fusion, Bla cleaves the substrate, 
resulting in a green blue shift with excitation at 409 nm. Inhibition of fusion by sera thus 
provides a detectable change. 

15 EXAMPLE 16: Stabilisation of inactivated SARS-CoV 

Although the purified inactivated SARS-CoV vaccine is capable of inducing potent 
neutralizing antibody responses in animals, it is relatively instable and can benefit from 
formulation to increase stability for an acceptable period of time. Suitable formulation changes 
include the use of various buffer systems, pH ranges, stabilizing excipients {e.g. sugars and 
iO sugar alcohols, amino acids, etc.) etc.. Stability testing can be conducted in real-time at normal 
storage temperatures, or can be conducted in an accelerated manner by using elevated 
temperatures. Vaccine stability can thus be increased to approximately one year or longer. 
Lyophilized vaccine formulation can also be used to extend shelf-life, possibly with further 
additives for stability during lyophilisation. 
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EXAMPLE 17: Dose and schedule optimization for inactivated virus 

Animal models of SARS-CoV infection have been reported, including mice, ferrets and 
macaques. As mentioned in example 4 above, mice immunized with the BPL-SARS-CoV 
vaccine achieve neutralizing antibody titers in the range of 1:100 - 1:1000, .similar to levels 
5 found in convalescent patients, and are 100% protected from infection with a challenge virus. 
While the mouse challenge model is limited only to infection but not disease, ferrets and 
macaques are useful models of the human SARS disease. Two to four days after inoculation with 
SARS-CoV, both ferrets and macaques have been found to shed infectious SARS-CoV particles 
from the throat, nose and pharynx, as demonstrated by RT-PCR and/or virus isolation on Vero 
10 cells. At approximately the same time, the infected animals became lethargic, show respiratory 
distress and eventually die. Histologically, SARS-CoV infection in these animals associates with 
pulmonary lesions of different severity, similar to those found in biopsied lung tissue and 
autopsy material firom S ARS patients. With the availability of these models, preclinical studies 
with vaccines can be performed initially in mice for immunogenicity readouts, while efficacy of 
IS optimal doses and schedules can be assessed in the ferret and macaque models. 

• Initial studies in mice are used to determine the optimal dose and schedule required to 
elicit the highest levels of neutralizing antibody, with titers at least in the range of 1/100 - 
1/1000. In parallel to the assessment of neutralizing activity, other features of the hunioral 
immune response and cellular immune responses can be investigated. In particular sera from 
JO immunized mice can be assessed for the isotype (IgGl vs. IgG2a) of the Spike-specific antibody 
response. Also, the frequencies of splenic CD4+ T cells producing lE^-y and IL-4 in response to • 
BPI>SARS-CoV particles wOl be assessed by ELISPOT and ELISA. Hiese exp^imehts can 
provide insight into the quality of the T ceU response helping the p rimin g of a protective 
antibody response. 

25 Increasing vaccine doses can be tested {e.g. from 5 to 20 /tg of BPL-SARS-CoV alone or 

muted with an equal volume of MFS9-citrate), administered SC to anesthetized mice in 100/tl of 
inoculum. Groups of B ALB/c mice, 10 per treatment, are immuniz ed, with priming at day 0 and 
boosting at days 14 and 28. Secondary endpoints compare the kinetics of neutralizing vs. Spike- 
specific antibody titras and assess the Thl/Ih2 profile of the specific immune response, and so 
30 neutralizing and Spike-specific antibody titers are assessed at days 7, 21, 35, and at 2, 3, 4, and 5 
months after priming. The IgG2a and IgGl titers of Spike-specific antibodies are determined at 
days 21, 35, and at 2, 3, 4, and 5 months after priming. Proliferation and IFN-y and IL-4 
production by splenic T cells against recombinant Spike protein firom SARS-CoV are assessed at 
day 42, and at the end of the 5th month. Peripheral blood is collected at days 7, 21, 35, and at 2, 
35 3, 4, and 5 months after priming. Spleen cells will be obtained at day 42 and at the end of the 5th 

month. Neutralizing and Spike-specific antibody titers and isotypes are determined by inhibition 
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of infection of Vero cells and by ELIS A, respectively. Proliferation of splenic cells is determined 
by ^(HJ-thymidine uptake. Frequencies of splenic IFN-y and IL^ producing GD4'*’ T 
lymphocytes is determined by ELISPOT and FACS analysis. 

Based on mouse results, the BPL-SARS-CoV vaccine can be tested in ferrets for the 
5 induction of protective neutralizmg antibody titers. Ferrets are immuniz ed according to a similar 

schedule as the mice and at the dose that elicits the highest neutralizmg antibody titers in mice at 
day 35 after the second boost Three groups of ferrets, 6 per treatment, are immunized with BPL- 
S ARS-CoV, alone or mixed with an equal volume of MF59-dtrate, administered SC to 
anesthetized animals in 200/d of inoculum. The animals are primed at day 0 and boosted at days 
10 14 and 28. Peripheral blood is collected at da^ 7, 21, and 35. Neutralizing and Spike-specific 

antibodies titers are determined by inhibition of SARS-CoV infection of Vcto cells and by 
ELISA, respectively. Each group of ferrets is used to assess efficacy of the BPL-SARS-CoV in 
protecting vaccinated animals fix>m infection and/or disease. Anesthetized animals are 
challenged intratracheally, two weeks after the last boost, with lO* median tissue culture 
15 infectious dose units (TCID 50 ) of the SARS-CoV CDC strain. Infection by SARS-CoV can be 

assessed by taking nasal, pharyngeal and rectal swabs from animals for 20 days after challenge > 
(Martina et aL supra). The presence of SARStCoV in sample materials can be assessed by RT- 
PCR and infection assay of Vero cells. Animals can be monitored for clinical signs of SARS 
disease by assessing sleeping time, temperature, respiratory symptoms, diarrhea, body weight 
iO and survival. Protection can be determined by the magnitude and duration of virus shedding, by 
• i duration and severity of disease symptoms, and by percentage of siuviving animals. The 

formulation eliciting the highest neutralizing antibody titers at day 35 can then be tested against a 
two-fold higher dose of BPL-SARS-CoV given in the same formulation in the same regimen. 

Additional studies can evaluate immunogenicity and efficacy of the candidate vaccine in 
^5 non-human prirnates. Three groups of adult cynomolgus macaques, 4 per treatment, are 
immunized with BPL-SARS-CoV, alone or mixed with an equal volume of MF59-citrate, 
administered SC to anesthetized animals in 500/tl of inoculum. The BPL-SARS-CoV vaccine 
can be tested at the dose eliciting the highest neutralizing antibody titers in ferrets at day 35 after 
the second boost* The animals are primed at day 0 and boosted at 3 and 6 weeks. Peripheral 
10 blood is collected at weeks 1, 4, and 7. A secondary endpoint is to assess the Thl/Th2 profile of 
. the specific immune response. Neutralizing and Spike-specific antibody titers and frequencies of 
peripheral blood CD4+ T cells producing IFN-y and lL-4 in response to die recombinant SARS- 
CoV Spike protein is thus assessed at weeks 1, 4, and 7. Neutralizmg and Spike-specific 
antibody titers can be determined by inhibition of SARS-CoV infection of Vero cells and by 
15 EUSA, respectively. Intracellular cytokine staining and FACS analysis will be used to quantify 
IFN-y- and IL-4-producing Cl^f^ T cells. The macaques can also be used to assess efficacy of 
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the BPL-SARS-CoV in protecting vaccinated anknals from infection and/or disease. ' 
Anesthetized macaques can be challenged two weeks after the last boost with 10^ median tissue 
culture infectious dose unit (TCID 50 ) of the SARS-CoV C!DC strain in a 5 ml volume. A few 
drops of the virus can also be administered on each of the conjunctiva, 0.5 ml in the nose and the 
5 remainder in the trachea. Infection by S ARS-CoV can be assessed by taking nasal, pharyngeal, 

and rectal swabs, and feces fiem animals for 20 days after challenge (Fouchier et al. (20030 
Nature 423:240). Hie presence (rf SARS-CoV in sample materials can be assessed by RT-PCR 
and infection assay of Vero cells. Animals can also be monitored for clinical signs of S ARS 
disease by assessing sleeping time, temperature, respiratory symptoms, diarrhea, body weight 
10 and survival. Protection can be determined by the magnitude and duration of virus shedding, by 
diuration and severity of disease symptoms, and by pracentage of surviving animal s. 



Mice 



Group 


Treatment 


Dose/Route 


Sampling interyal 


Number of mice 


1-3 


BPL-SARS-CoV 


20 , 10, 5 pg/SC 


7 , 21. 35 d; 

2, 3, 4, 5 m; 


10 per dose level 


4-6 


BPL-SARS-CoV 


20, 10. 5 Mfi/SC 


42 d 


10 per dose level 


7-9 


BPL-SARS-CoV MF59 


20, 10. 5 /tg/SC 


7, 21. 35 d; 
2, 3, 4, 5 m; 


10 per dose level 


10-12 


BPL-SARS-CoV MF59 


20, 10. 5 Mg/sc 


42 d 


10 per dose level 


.13 


MF59 


NA/SC 


7, 21, 35 d; 
2, 3, 4, 5 m; 


10 + 10 (sacrificed at 
42 d and end 5 m) 


14 


Saline 


NA/SC 


7,21, 35 d; 
2, 3, 4, 5 m; 


10 + 10 (sacrificed at 
42 d and end 5 m) 



Ferrets 



Group 


Treatment 


Route 


Sampling interval 


No. of ferrets 


1 


BPL-SARS-CoV 


sc 


7 , 21, 35 d 


6 


2 


BPLSARS-C 0 V-MF 59 


sc 


7, 21, 35 d 


6 


3 


Saline 


sc 


7,21, 35 d 


6 



Macaques 



Group 


Treatment 


Route 


Samplins interval 


No. of macaques 


1 


BPLSARS-CoV 


SC 


1,4, 7 w 


4 


2 


BPLSARS-CoV- 

MF59 


SC 


1,4, 7 w 


4 


3 


Saline 


SC 


1.4. 7 w 


4 



EXAMPLE 18: Human T ceU responses 

As a prelude to initiation of clinical studies in humans, the reactivity of peripheral blood T 
20 lymphocytes from healthy donors with different HLA haplotypes can be assessed using the in 
vitro priming technique (Abrignani et al. (1990) Proc Natl Acad SciUSA 87:6136-40). The aim 
of this study is to have a first indication of the immune-dominant T cell epitopes in SARS-CoV 



- 308 - 






wo 2004/092360 



PCT/US2004/011710 



proteins. Briefly PBMCs from 20 healthy donors with dijEfeient HLA haplotypes will be cultured 
in medium con taining 5% autologous serum, in the presence of different concentration of SARS- 
BPL-CoV particles in the range from 0.5 to 20 /tg/ml. The expression of activation markers will 
be assessed after 24 and 48 hours. Frequencies of IFN-y- and IL4- producing T lymphocytes 
5 win be assessed after 12h and after 15 days in culture, in the presence of 100 U/ml recombinant 
human IL-2. Activated and cytokines producing CD4 T lymphocytes will be sorted and 
eventually cloned as single cells using FACS technologies. The CD4+ T ceU repertoire from 
human subjects with different HLA will be assessed by proliferation assays of the CD4+ T cell 
lines and clones against autologous EBV-transfoimed cell lines loaded with 15-mer overlapping 
10 peptides finm the most relevant structural and non structural protein of the SARS-CoV. 

When moving to actual human trials, safety and immune responses will be evaluated in 
healthy adults foUoAving intramuscular immunization with escalating doses of the BPL^-' 
inactivated SARS-CoV vaccine, with MF59 adjuvant being included or omitted depending on 
preclinical data. Tliree/four immunizations will be given at 0, 1, 6 months in the first cohort, and 
15 at 0, 1, 2, 6 mouths and 0, 2, 6 weeks in die second and third cohorts respectively. The trial will . 
be observer blind and placebo controlled. Subjects wfil be randomized into each dose level, 
foomime response parameters to be measured will include serum neutralizing antibodies, ELISA 
antibodies and peripheral blood IFN-gamma-producing CD4+ T cells by intracellular cytokine 
staiiiing. 



Group 


Antigen 
dose (ng) 


Administration 

schedule 


No. treated 
subjects 


No. subjects 
with placebo 


Sampling interval 

s 


Al 


10 


0,1,6 months 


18 


6 


0, 1, 2, 6, 7 mos 


A2 


20 


0,1,6 months 


18 


6 


0, 1, 2, 6, 7 mos 


B1 


10 


0,1,2,6 months 


18 


12 


0, 1, 2, 6, 7 mos 


B2 


20 


0,1,2,6 months 


18 


12 


0, 1, 2, 6, 7 mos 


Cl 


10 


0,2,6 weeks 


18 


12 


0, 2, 6, 10, 30 wks 


C2 


20 


0 A6 weeks 


18 


12 


0, 2, 6, 10, 30 wks 



» 

EXAMPLE 19: Selection ofCHO cell lines for Spike protein ejq/ression 

Methods for the derivation of Chinese Hamster Ovary (CHO) cell lines that stably express 
viral envelope glycoproteins that are conformationally intact, appropriately glycosylated and 
efficiently bind neutralizing antibodies are well established for HIV and HCV (Srivastava et oL 
15 (2002) J Virol 76:2835-47; Srivastava et al. (2003) J Virol 77:11244-259; Heile et al. (2000) J 

Virol 74:6885-92). The same techniques can be applied to SARS-CoV, to generate two different 
stable CHOK-1 cell lines producing either full-length or truncated SARS Spike proteins. The 
Spike proteins can be expressed using the constructs described herein, but without the bexa-His 
tags. These proteins can compared for their ability to produce neutralizing antibodies in 
10 immunized animals as well as for their expression levels in CHOK-1 cells. 
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A pCMV3 vector expressing Spike can be used for the derivation of stable CHOK-1 cell 
lines, containing the CMV enhancet/promotrar, ampicillin resistance, and a fused DHFR and 
attenuated neomycin gene for selection purposes. Stable cell lines can produced using the 
neomycin selection system in CHOK-1 cells. Qones can be sequenced to verify the integrity of 
5 the insert, and transirat transfections can be performed using Trans-LTl polyamine transfection 
reagent (PanVera Corp., Madison, WI) to assess the expression level and also the integrity of the 
expressed protein by ELISA and western blot analysis. 

Initial CHO cells will be selected to be free from TSE/BSE contaminants and risks 
according to relevant regulatory standards. To constmct cell lines, procedures involve 
10 transfection, primary screening with selective medium, followed by subcloning to assure purity 
of ceU lines. CeU supernatants can be assayed using an antigen capture ELISA to quantify 
expression levels at aU stages of selection and amplification. For full-length Spike expression, 
methanol fixed cells can be screened for internal expression by immunofluorescent staining 
using a rabbit anti-SARS antibody. Successive measurements at the T75-flask stage of 
15 expansion canbe employed to assure stability of expression levels. The molecular mass and 

integrity of the expressed proteins can be checked by PAGE both under native and reducing and 
denaturing conditions, followed by immunoprobing. 

The pCMV3 vectors expressing SARS-CoV Spike proteins in either full-length or 
truncated forms can be introduced into CHOK-1 cells using the Trans-LT-1 reagent and non- 
20 selective media. 24-48 hours post-transfection, depending on cell density, cells are split at a 1:5 . 
ratio and the medium can be changed to selective media containing neomycin at 500/ig/ml. Any . 
bovine serum used in these procedures will be from TSE-free sources that meet regulatory 
standards. Teri to fourteen days later,-individual colonies can be picked and transferred to 96 
well plates and cultured in complete nrai-selective medium. When approximately 80% of the 
25 weUs are confluent, 24 hour supernatants can be screened by Spike capture EUSA. For initial 
expression of full length Spike protein, cells can be fixed witii methanol and screened by 
immunofluorescent staining using a rabbit anti-SARS antibody. After low-expressing cell lines 
have been eliminated and there are fewer than 20-30 cell lines, capture ELISA and western blots 
can then be used to determine the expression level after ceU lysis. A portion of each ceU Une can 
30 be pelleted, weighed and lysed in 1% Triton lysis buffer for determination of expression levels. 
Three to four clones producing the highest levels of spike protein in correct stracture and 
conformation can be expanded to three-liter bioreactors and adapted to low serum suspension 
culture conditions for scale-up. 

The antigen capture ELISA assay for die SARS spike protein can be performed using 96 
35 weU flat-bottom plates coated with 250ng per weU of puriUed immunoglobulin obtained from 
rabbit sera that were immuniz ed with inactivated SARS virus. Supernatant or lysate samp les are 
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added and incubated for 2 hours at 37°C. Bound antigen is reacted against pooled SARS'*''* 
serum or high affinity monoclonal antibo(fy either human or mouse against SARS spike protein 
and detected using ^ipropiiate species-specific peroxidase-conjugated second antibody, llie 
plates are developed using TMB substrate (Pierce, Rockford, IL), read at a wavelength of 
5 450nm, and the concentration of protein per ml sample is derived jfiom a standard curve (OD vs. 

protein concentration) based on serial dilutions of a known concentration of recombinant spike 
protein. 

Tire immunoprobing analysis Avill also be performed following the standard methods 
descnbed by Srivastava et aL (2002) supra. Briefly, 10-20/il of the sample is analyzed on 4-20% 
10 SDS PAGE under non-reducing/denaturing conditions with mild heating. The proteins are then 
transferred onto nitrocellulose membranes and reacted against polyclonal anti-Spike rabbit 
sCTum, followed by anti-rabbit Ig conjugated to Alexa 688 (Molecular Probes, Oregon). The 
blots are scaimed using an infrared imaging system. 

The highest expressing candidate ceU lines will be screened for Spike protein expression 
15 and stability in small-scale (3 liter) perfusion bioreactors. The candidate clones will be fiirther . 
evaluated for level of expression as well as integrity of expressed proteun, and subsequently 
tested for expression stability in the absence of selection. The selected clones also will be tested 
for maintenance of the DNA sequence integrity of the integrated SARS spike protein gene. To 
quickly monitor the expression levels in small flasks and in the three liter evaluation cultures, a 
20 lectin-based process (Gluvanthus Nivalis lectin) has been developed to isolate SARS spike 

pnoteih to a degree of purity that allows semi-quantitation and characterization of the protein in . 
CHO supernatant. FUll-length Spike protein will be obtained fix>m Triton X-100 detergent 
extracted ceUs and thra captured on GNA lectin, followed by hydroxyapatite and SP ' 
chromatograph. Eluted protein is then characterized by: (1) polyacrylamide gel electrophoresis 
25 (PAGE) and Coomassie staining, (2) immunoprobing with anti-SARS rabbit sera, (3) structural 
characterization using size exclusion chromatography (SEC), as well as mass spec analysis using 
MALDI-TOF. 

Productivity from the CHO cell line expressing SARS spike protein should be at least 2 
mg/L and for fuU-length Spike protein wiU be 3mg/100gm of cells, at steady-state cell density. 

30 Yield from one 45 day, 2.5-liter bioreactor wiU be -1000 mg crude protein. 

EXAMPLE 20: Purification of spike protein for human vaccines 

To puriify SARS spike protein for the purpose of producing GMP grade mafftri al for human 
use, the following basic process is used, with all steps being performed at 2-8°C: die starting 
material, concentrated CHO cell culture supernatant (20-30X) is thawed and filtered through a 
35 0.45/im membrane; this material is heavily contaminated proteins from culture, as well as DNA; 
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the first purification step is affinity chromatography using Gluvanthus Nivalis (GNA), a lectin 
that preferentially recognizes terminal mann ose containing carbohydrates; glycosylated proteins, 
including SARS spike protein are crq)tured and non-glycosylated proteins, as well as DNA, do 
not bind to this column; the GNA column is followed by two chromatographic steps operated in 
S the flow throng mode; the anion exchan^r, DEAE, and coamic hydroxyapatite (cHAP); 
DEAE binds some cont aminatin g supernatant proteins and DNA, whereas cHAP binds any 
contaminating serum proteins; full-length Spike protein is purified from the cell pellet; the cells 
ate lysed with Triton X-100 and full-length Spike protein is then captured on GNA lectin, 
followed by hydroxyapatite and SP chromatography. 

10 Tlie purified SARS spike can be further treated to remove adventitious viruses: viral 

inactivation at pH 3.S for 1 hour; the sample is then concentrated and diafiltered into a buffer at 
pH 4 and finally captured the purified protein using SP resin; the spike protein binds to -this resin 
and many viruses flow through. 

The spike protein is eluted, concentrated and diafiltered into formulation buffer. This 
15 formulated bulk product is then filtered through a DV50 viral removal membrane followed by 

filtration through a 0.2 (im membrane. The formulated bulk u filled into suitable containers e.g. 
into 3.0 ml vials, in a class 100 laminar flow hood. 

In process testing at each step of the purification includes protein concentration, endotoxin 
(LAL), bioburden, and recovery. 

20 Prior to human administration, a test for potency will evaluate the specific ability of the 

vaccine in an m vitro or in vivo test to effect a given response. The in vivo immunogenicity will 
be determined by dosing groups of 10 mice with various doses of the protein antigen. Sera will 
be analyzed for die presence of IgG antibodies using an ELISA. The criterion for passing will be 
based upon the number of vaccine treated animal s that are seropositive compared to a reference 
25 standard. Other tests include General Safety, sterility, purity, identity of the vaccine (using an 
ELISA specific for Spike protein), and quantity & protein concentration (UV spectrophotometric 
absorbance procedure based on the molar absorbance of the aromatic amin o acids). 

Stability testing will be performed on the bulk drug substance and on the final container 
product. Bulk product will be evaluated at temperatures of -60°C (recommended storage 
30 condition), 25 ±2 °C and 40 ±2 °C protected from light, at time points of 0, 3, 6, 9, 12 mondis. 
Final container product will be tested at temperatures of -60 °C, and inverted at 5 ±3 °C, 25 ±2 
®C, and 40 ±2 ®C at time points of 0, 3, 6, 9, 12 months. Stability-indicating assays may include 
appearance, pH, protein content, SDS-PAGE, size exclusion HPLC, and container/closure 
integrity, performed on single samples of bulk and triplicate vials of final container mat^al. 

35 The protein purified in this way can be evaluated in mice, rabbits and ferrets as described 

in, and based on the results of, examples 4, 5, 8 and 9 above. 
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Initial experiments will be performed in mice to determine optimal dose and schedule of 
the GMP Spike protein required to elicit the highest levels of neutralizing antibody, witii titers at 
least in the range of 1/100 - 1/1000. Spike protein will be tested in the range from 5 to 40 pg, 
alone or mixed with an equal volume of MF59-citrate, to anesthetized mice in 100/il of 
S inoculum. Groups of BALB/c mice, 10 per treatment, wUl be immunize d. The animals will be 
primed at day 0 and boosted at days 14 and 28. Secondary endpoints will be to compare the 
kinetics of neutralizing vs. Spike-specific antibody titers and to assess the Thl/Th2 profile of the 
specific immu ne response. Neutralizing and Spike-spedfic antibody titers will be assessed at 
days 7, 21, and 35 and at 2, 3, 4, and 5 months after priming; the IgG2a and IgGl titers of Spike- 
10 specific antibodies will be determined at days 21 and 35, and at 2, 3, 4, and 5'montbs after 

priming; proliferation and IFN-r and ILr4 production by splenic T ceU against the recombinant 
Spike protdn fixjm SARS-CoV will be assessed at day 42 and at the end of the 5th month. 
Peripheral blood will be collected at days 7, 21, and 35 and at 2, 3, 4, and 5 months after 
priming; spleen cells at day 42 and at tiie end of the 5th month. Neutralizing and Spike-specific 
15 antibody titers and isotypes will be determined by inhibition of SARS-CoV infection of Veto 
cells and by ELISA, respectively. Proliferation of splenic cells will be determined by ^[H]- 
thymidine uptake. Jrequencies of splenic IFNry and IL-4 producing CD4+ T lymphocytes, will 
be determined by ELISPOT and FACS analysis. 

Next, the optimal dosing and schedule for recombinant Spike vaccine will be determined in 
20 ferrets. Based on the mouse results, the Spike vaccine eliciting the highest antibody neutralizing 
titers will be tested against a two-fold higher dose of recombinant Spike protein given in the 
same formulation. Three groups of ferrets, 6 p^ treatment, will be immunize d SC under 
anesthesia with 200pl of inoculum. Hie animals will be primed at day 0 a[nd boosted at days 14 ' 
and 28. Peripheral blood will be collected at days 7, 21, and 35. Neutralizing and Spike-specific 
25 antibodies titers will be determined by inhibition of SARS-CoV infection of Vero cells and by 

ELISA, respectively. Similar to the previous ferret studies, each group of animals will be used to 
assess efficacy of the vaccine in protecting immunized animals fijom infection and/or disease. 

hnmunogenicity and efficacy of the candidate vaccine also will be evaluated in nonhuman 
primates. Three groups of adult cynomolgus macaques, 4 per treatment, will be immunized with 
30 recombinant SARS-CoV Spike protein, alone or mixed with an equal volume of MF59-citrate, 

administered SC to anesthetized animals in 500 /xl of inoculum. The Spike protein vaccine will 
be tested at the dose eliciting the highest neutralizing antibody titers in ferrets at day 35. The 
animal s will be primed at day 0 and boosted at 3 and 6 weeks. Peripheral blood will be collected 
at weeks 1, 4, and 7. A secondary endpoint wiU be to assess the Thl/Th2 profiile of the specific 
35 immune response, as described above (neutralizing and Spike-specific antibody titers. 
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frequencies of peripheral blood CD4+ T cells producing IFN-^ and EL-4 in response to the 
recombinant Spike protein, assessed at at weeks 1, 4, and 7). 

Rnally, hiiinan phase I, placebo-controUed, dose-escalation, safety/ immunogenicity trials 
will be poformed for the IM recombinant SARS vaccine with MF59 adjuvant Hie trial will 
5 evaluate safety and immune responses in healthy adults following immunization with escalating 

doses of SARS recombinant vaccine with MFS9 adjuvant, administered intramuscularly. 
Three/four immunizations will be given at 0, 1, 6 months. The trial wiU be observer blind and 
placebo controlled. Subjects will be randomized into each dose level. Immune response 
parameters to be measured include serum neutralizing antibodies, ELISA antibodies and 
10 peripheral blood lEN-Tr-producing CD4+ T cells by intracellular cytokine staining: 



Group 


Vaccine 
Antigen 
dose (ms) 


Admimstratioii 

schedule 


No. of 

treated 

subjects 


No. of subjects 
with placebo 
(MF59) 


Sampliii^ interval 


Al 


50 


0,1,6 months 


18 


6 


0, 1, 2, 6, 7 months 


A2 


100 


04»6 months 


18 


6 


0, 1,2, 6, 7 months 



EXAMPLE 21: Comparison of inactivated virus and purified Spike protein 

Immunogenicity and efficacy of the inactivated virus vaccine and the purified Spike 
protein can be compared in non-human primates. Three groups of adult cynomolgus macaques, 4 
15 for treatment, will be immu nized with recombinant SARS-CoV Spike protein from CHO cell 
lines or with BPL-SARS-COV, given in the dose and formulation eliciting the highest 
neutralizing antibody titers in previous immunogenicity challenge experiments, administered SC 
to anesthetized animals in 500/il of inoculum. The animals will be primed at day 0 and boosted 
at 3 and 6 weeks. Peripheral blood will be collected at weeks 1, 4, 7. A secondary em^oint will 
20 be to assess the Thl/Hi2 profile of the specific immune response, as described above. 



Group 


Treatment 


I>ose/Route 


Sampling interval 


No. of macaques 


1 • 


Rec-Spike protein 
+ or - MF59 


Y/tg/SC 


1.4, 7 w 


4 


2 


BPL-SARS-CoV 
+ or - MF59 


Y Mg/sc 


1,4, 7 w 


4 


3 


Saline 


NA/SC 


1.4, 7 w 


4 



EXAMPLE 22: Expression in yeast 

Yeast is a useful and inexpensive eukaryotic expulsion system. Yeast-expressed proteins 
are used in recombinant hepatitis B virus vaccines, and recombinant SARS antigens may also be 
25 expressed in yeast for vaccine purposes. Yeast-expression is also convenient for the production 

of antigens for preparing monoclonal and polyclonal antitobodies, or for use in sm)logical 
assays. 
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The nucleocapsid protein (N) and two different vearsions of the spike glycoprotein (S) from 
SARS coronavirus FRA strain (A Y3 10120) were cloned for expression in S.cerevisiae: 

SARS N: aa 1 - 422 (coordinates 28120-29388 of AY3 10120 strain) - Hg.65 
SARS spike: aa 14 - 1195 (transmembrane domain and cytoplasmic tail deleted) -Hg.66 
5 SARS spike: aa 14 - 662 (SI domain) 

To make the SI construct, a Xhol-Notl fragment of approximately 3733bp encoding the full- 
length spike glycoprotein was the starting point PCR was used to amplify the full-length gene 
in two pieces: JQ>aI-BlnI of 2440bp and Blnl-Sall of 1306bp. These fragments were subcloned 
into commercial vectors (Novagen): pT7Blue2 Xbal-Blnl (5' end of spike glycoprotein) and 
10 pT7Blue2 Blnl-Sall (3' end of spike glycoprotein; Rgure 58), respectively. Hie foDowing 
primers were used in the subsequent PCR reactions: Spk-1 (5') SEQ ID NO: 9785; Spk-2 (5*) 
SBQ ID NO: 9786; Spk-3 (5*) SEQ ID NO: 9787; Spk;4 (50 SEQ ID NO: 9788. 

. E. coli HBlOl cornpetent cells were transformed with the PCR ligation product and plated ■' 
on Luria agar plates, containing 100/ig/rnl ampicillin. The desired clones were identified using • 
15 miniscreen DNA analysis. Aft^ sequence verification and plasmid amplification of the desired ' 
subclones, it was desirable to eliminate the internal SaU site present in the Xbal-Blnl portion of . 
the spike sequence in order to facilitate future cloning into the yeast expression vector (BaniHI- . 
Sail). Therefore, we prepared a CeUI-Mfel vector fiom the pT7Blue2 Xbal-Blnl (5’ end Spike) 
subclone to eliminated a 143bp sequence containing the SaU site. Kinased oligos DSl-6 (SEQ 
20 ID NOS: 9789-9794) were then ligated into the CelH-Mfel vector to replace the 143bp that were • 
removed to mutate die Sail site (no aa changes), creating pT7Blue2.XbaI-BlnIAsal. 

The 5' Xbal-Blnl (from pT7Blue2 Jfbal-Blnl ASal) and the 3' Blnl-Sall (from pT7Blue2 
Blni-SaU) spike glycoprotein inserts woe gel-purified and ligated them into the p893-l Xbal- 
SaU vector (a vector derived from plitmus 38 (New England Biolabs) with the alpha-factor 
25 leader sequence cloned into the BamHI -SaU sites of the MCS). The resulting full-length SARS 

Spike coding sequence was named p893-l.SARS Spike 1255 #9 (Figure 58). 

RcoK HBlOl competent cells were transformed with the oligo replacement ligation 
product and plated on Luria agar plates, containing 100/tg/ml ampicilhn. The desired clones 
were identified using miniscreen DNA analysis. After sequence verification of the positive 
30 clones, pT7Blue2 Xba-Bln ASal was chosen for use as a template for PCR reactions to amplify 
the Spike SI 1967 bp Xba-Sal fragment The fragment was then subcloned into the p893-l Xba- 
Sal vector, sequence verified, and named it p893-l.Spike SI #11 (Figure 59). 

hi order to clone into the S.cerevisiae expression vector, pBS24.1, the 5' raid of the SI 
sequence had to be modified from Xbal to lUndUI to aUow ligation with the 3' FlindTTT end of the 
35 ADH2/GAPDH BamHI-IUndin promoter fragment From pT7Blue2 Xba-BlnASal (described 

above) an Agel-SaU 1943bp fragment was gel-purified. This fragment was ligated along with a 
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synthetic pair of HindlTT-Affll 30bp kinased oligos (Sl-l+Sl-2 creating ttie necessary 5' HindHI 
site) into the pSP72 Hindni-Sall conunercial subcloning vector (named pSP72.SARS Spike SI 
#2; Kgure 59). Sl-1 had SEQ ID NO: 9795 and Sl-2 has SEQ ID NO: 9796. 

After sequence verification of the positive clone fiom miniscreen DNA analysis, the 
5 Hindin-SaU fragment was gel purified. Hie 1365 bp B airiH T- HindTTT ADH2/GAPDH promoter 
fragrhent was ligated along with the 1973 bp HindTTT-Sall SI fragment into the pBS24.1 B amH T- 
Sall vector creating the genetically engineered ptLSARS Spike SI #2 expression plasmid (Figure 
60). 

S.cerevisiae strain AD3 was transformed with pd.SARS Spike SI #2 and single 
10 transformants were checked for expression after depletion of glucose in the medium. Hie 

recombinant protein was expressed at high levels in yeast, as detected by Coomassie blue 
staining, hi particular, yeast cells were transformed with the SAKS SI expression plasmid using 
the Invitrogen S .c. EasyCorhp™ Transformation Kit. Expression in shown in Figure 57v 

• To express Spike 1195 protein, which does not contain the trans-membrane (TM) region or 
15 cytoplasmic tail that are present in the fuU-lenglh SARS construct, the following series of genetic 
manipulations was pmformed: 

’ From pT7Blue2 Blnl-SaUL #1 1 (described above) a Blnl-Dral 1056bp fragment was gel 
purified. Hiis fragment was ligated with a synthetic pair of 68bp Dral-Sall kinased oligos 
(DRSl+2; SEQ ID NOS: 9797 & 9798) into a pT7Blue2 Blnl-Sall vector (Figure 61). E.coli 
20 HB 101 competent cells were transformed with the oligo replacement ligation product and plated - 
on Luria agar plates, containing 100/ig/ml ampicillin. The desired clones were identified using 
miniscreen DNA analysis. After sequence confirmation the clone was named pT7Blue2 Blnl- 
Sal Spike 1 195 #7. The 1 126bp Blnl-SaU fragment encoding the 3’ end of the Spike 1195 was 
gel purified (Fig.61). 

25 Li order to generate the Xbal-Sall Spike 1195 fragment, the 3109bp XbM-Pcil firagment 

was isolated from the p893-l.SARS Spike 1255 #9 (described above) and a 457bp Pci[-Sall 
fragment from pT7Blue2.SARS Spike 1195 #7 (described above). The two fiagments were 
cloned into the p893-l Xbal-Sall vector, creating the p893-l.SARS Spike 1 195 #34 plasmid 
(Rguie 62). 

10 To clone SARS Spike 1195 into the pBS24.1 Saccharomyces cerevisisae expression 

vector, it was necessary to modify the 5' end of the SARS Spike 1 195 from Xbal to HinHTTT, as 
done for the Spike SI expression clone described above. To begin, the 2416bp Agel-Blnl 
firagment was isolated from p893-l.SARS Spike 1195 #34. Hiis firagment was ligated with the 
synthetic Hindlll-Agel 30bp oligos (described above to generate the SI protein for expression in 
35 S.cerevisiae) into the pT7Blue2 Hindlll-Blnl vector. E. coli HBlOl competent cells were 
transformed with the oligo replacement ligation product and plated on Luria agar plates, 
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containing 100/ig/ml ampicillin. The desired clones were identified using miniscreen DNA 
analysis. After sequence verification of the positive clone and plasmid amplification of 
pT7Blue2.SARS 1195 5’ Hindin-Blnl#10 (Rgure 63), we isolated a 402bp lEndHI-NcoI 
fragment and the 2044bp Ncol-Blnl fragment (Hgure 63). It was necessary for llie ffindDI-Blnl 
5 isolation to be done in two steps to avoid cloning issues related to die internal TfindTTT site 
located at nucleotide number 1319 of the spike 1195 protein. 

To assemble the BamHI-SalE-expression cassette of Spike 1195 into the pBS24.1 vector 
E.coli HBlOl competent cells were transformed with the the BamHl- HindTTT (ADH2/GAPDH 
promoter), Hindin-Ncol 402bp fragment, Ncol-Blnl 2044bp and the Blnl-Sall 1126bp fragments 
10 into the pBS24.1 Bamlll-Sall vector. Ibe samples were plated on Luria agar plates, containing 
100/tg/ml ampicillin. The desired clone was identified using miniscreen DNA analysis, thus 
creating the genetically engineered pd.SARS Spike 1195 #10 (Figure 64). 

! S.cerevisiae strain AD3 was transformed with pd.SARS Spike 1195 #10 and single 
transformants were checked for expression after depletion of glucose in the medium. The 
15 recombinant protein was detected by Coomassie blue staining. In particular, yeast cells were 
transformed with the SARS 1195 expression plasmid using the Invitrogen S.c. EasyCompT*^ 
Transformation Kit. 

EXAMPLE 23: Expression in mammalian ceU lines 

cDNA fragments containing the S protein ORF of 1255 amin o acids were am plifi ed by 
20 RT-PCR from SARS viral RNA (Frankfurt isolate) grown in Veio cells. The amplified PCR 
fragments were cloned into pBlueScript vector, sequenced, and consensus spike sequence was 
assembled to create a full-length SARS spike clone, pBSnSh. In vitro transcription of pBSnSh 
followed by translation in a rabbit reticulocyte lysate resulted in the production of single 
polypeptide with an estimated molecular mass of ~140 kDa. 

25 The insert of this plasmid was recloned via Xhol and Not I into a mammlian expression 

vector pCMVin (Srivastava et oL (2003) J. Virol 77:11244-11259) to create a construct, nSh 
(Fig. 74A). A PCR fragment containing a spike protein of 1195 amino acid, which was deleted 
for transmembrane (TM) domain and cystein-rich cytoplasmic tail (Cy) was amplified and 
cloned pCMVin vector to generate the contstruct nShATC (Figure.74B). Both constructs woe 
30 tagged with six histidine residues at the C-terminus in order to aid in their characterization. The 

Xho I/Not I fragment without a histidine tag also was subcloned into the alphavirus replicon 
vector backbone pVCRchim2.1 for use in the production of an alphavirus replicon particle 
chimera that expresses S protein. Production and characterization of the replication defective 
alphavirus vector particles was performed essentially as described previously (Perri et cd, (2003) 
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/. ViroL 77:10394-10403; Polo et aL (1999) PNAS USA. 96:4598-4603). The resultant alphaviius 
vector particles were named as VEE/SIN. 

CX)S7 cells and BHK-21 cells were maintained in Dulbecco’s modified Eagle’s medium 
supplemented witfi 10% fetal bovine serum at 37“C and 5% CO 2 in air. COS7 cells were 
5 transfected with expression plasmids (nSh, nShATC) using a transfection kit (TransIt-COS, 
Minis) following the manufacturer’s protocol. The cells were washed once witii ice-cold PBS 
and lysed with lx Lysis buffo- (20mM MOPS, lOmM NaC3, 1.5mM MgCl 2 , and 1% Triton X- 
100) containing complete mini protease inhibitor (Roche). After a 30-min incubation on ice, the 
debris was cleared by centrifugation. The cleared lysate was either purified or used directly in 
10 western blottmg. 

To purify secreted spike proteins, medium fi^om transfected cells was collected and 
subjected to cCTtrifiigation at 12,000 rpm for 10 min to remove cellular debris. The cleared 
medium was applied to a ConA-agarose column (Vector Lab). The column was washed 
extensively with 20mM sodium phosphate buffer, and then the boimd proteins were eluted with 
15 IM methyl a-D-mannopyranoside (MMP), IM NaQ in 20mM sodium phosphate buffer; 

Column fractions containing SARS-CoV spike proteins were applied to MagneHis Protein 
purification system (E’romega) following the protocol suggested by die manufacturer. 

For western blot analysis, proteins were separated by 4-20% SDS-PAGE and then 
transferred electrophoretically to nitrocellulose membrane (Invitrogen). Membrane was blocked 
10 in blocking buffer (5% skim milk and 0. 1% Tween 20 in PBS) and incubated with indicated 
antibody at room temperature for 1 hr, washed and probed with horseradish peroxidase (HRP)- 
conjugated secondary antibody (Biosource) followed by chemiluminescence (ECL system, 
Amersham) and exposed by X-ray films. The antibodies used were a mouse monoclonal anti- 
histidine antibody (anti-His»tag Mab, Novagoi), a rabbit polyclonlal antipeptide antibody against 
25 SARS-CoV spike proten (SmPab, Abgent), or rabbit anti-SARS sera (2BE) obtained by 
immunization of rabbits with purified SARS-CoV virion. The latter has a cell culture 
neutralizing titer of 1/2,500. Unless stated otherwise, antibody was used at 1/1,000 for anti- 
histidine antibody and SmPab and 1/10,000 for anti-SARS rabbit sera. 

Some spike proteins were treated with Peptide-N glycosidase F (PNGase F). Cell lysates 
10 were diluted in 0.5% SDS and 1% P-mercaptoethanol and denatured at 100°C for 10 min. After 
2-fold dilution with 1% NP-40 in 50mM sodium phosphate (pH 7.5), the samples were treated 
with PNGase F (NEB) at 37°C for 1 hr. Enzyme-treated samples were analyzed by 4-12% SDS- 
PAGE in reducing condition. For a partial digestion with the PNGase, the ceD lysates were 
diluted with 50mM sodium phosphate (pH 6.0) containing 0.75% Triton-X and treated with 
35 PNGase F (Calbiochem) at 37®C for 3 hr. Enzyme-treated samples were analyzed by 4-20% 
SDS-PAGE in nonreducing condition. 
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Western blots of cells 48^bouis after transfection are shown in Rgure 75. Tte S protein 
was detected in cell lysates as a doublet with estimated molecular weight of ~170 ~180 kDa , 
when the lysate was boiled and analysed under reducing SDS-PAGE conditions (Rg. 75A, lane 
3). This doublet appears to result from differential glycosylation of one polypeptide product 
5 since pre-treatment of the cell lysate with PNGase F reduced the doublet to a single species of 
-140 kDa (Rg. 75A, lane 4). This is the expected size predicted from the aa sequence for a full- 
length, intact polypeptide product This experiment indicates that the full length SARS-CoV S is 
expressed in m a mmali an cells as a single, uncleaved polypeptide, but in two differentially 
glycosylated forms, gpl70 and gpl80 respectively. Unlike the two S glycofonns encoded by the 
10 full-length sequence, none of which were secreted, die SA protein product was detected both in 
cell lysates (Rg. 75A, lane 5) as well as in the cell culture medium (Rg. 75B, laneS) as a single 
species of - 160 kDa. 

In order to further characterize the intraceUular processing of the S protein, and as 
described above, 'BHK21 cells were infected with defective alphavirus particles expressing the 
15 full-lraigth S. At 6 hr post infection with a MOI of 5, infected cells were pulse labeled for 1 hr 

with U[“S] methionine/cysteine and chased for 2 or 4 hours. The [*^S]-labeled S protein was 

immuno-precipitated using the rabbit antiserum raised a^nst inactivated, purified virus and 
then digested with Endo R The Endo H treatment involved, dilution with a sample buffer (50mM 
sodium phosphate, 0.1% SDS, 50 mM DTT, pH 6.0) and boiling for 5 min. After denaturation, 

10 the samples were further diluted with 0.75% Triton-X 100 and treated with endoglycosidaseH 

(Endo H) following manufacturer’s protocol (Calbiochem) for 3 hr at 37“C. Enzyme-treated 
samples were added with gel loading buffer containing 0.1% SDS and DTT and analyzed by 8% 
SDS-PAGE. . 

Both digested and undigested proteins were boiled in SDS and analysed by reducing SDS- 
:5 PAGE (Rgure 55). After a 1-hr pulse, the S protein was apparent as a single gpl70 coniponent 
that was Endo H sensitive Ganes 1 and 2). After a 2-hr chase, a new species (gpl80) was present 
along with gpl70 in approximately equal proportions Gane3). After a 4-hr chase, the gpl80 
species was the dominant S protein component Qane 5) that was now Rido H resistant Ganes 5 
and 6). Hus data is consistent with gpl70 being an ER-resident glycoprotein containing high 
0 mannose chains and with gpl80 corresponding with a Golgi-processed glycoprotein containing 
Endo H-resistant complex oligosaccharides. 

Tbe Endo H sensitivity of die C-terminus deleted SA protein purified from cell culture 
media ws also tested. As shown in Rgure 76, the SA observed within cell lysates was found to be 
Endo H sensitive Ganes 1 and 2), while the secreted SA in cell culture media was Endo H 
5 resistant Ganes 3 and 4). This result is consistent with this glycoprotein being synthesized in an 
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imiDature fonn in the ER pnor to transfer to the Golgi where the complex carbohydrate is added 
and die protein then secreted. 

As already described, the S protdm expressed in COST cells was detected as a 
gpl70/gpl80 doublet in western blot analyses of cell lysates that were fully denatured by boiling 
5 in the presence of DTT. However, the majority of S protein was detected as a high molecular 

glycoprotein in the 440-669 kDa range when the same cell lysate was not heat-denatured prior to 
western blot analysis using SDS-PAGE (Hg. 77, lane 1). Ihe ~500 kDa species was resistant to 
10 mM DTT treatmait (lane 3) and not dissociated into die monomeric form unless the lysate 
was first heat-denatured at 100“C (lane 4). In contrast, oligomeric form of a test protein 
10 (Thyroglobulin) of which quaternary structure is held by disulfide-linkage was converted into 
subunit form by the 10 mM DTT treatment These data suggest that the ~500 kDa oligomeric 
form of S protein is not disulfide-linked and is heat labile. To confirm the heat-sensitivity of the 
~500 kDa species of S protein, the heat-denaturadon experiment was repeated but without DTT. 
As shown in Rgure 78, heat denaturation of -500 kDa protein at 100°C alone was sufficient to 
15 convert it into gp 170/180 monomeric forms (lane 4). Using a 80°C heat-denaturation step, both 
the -500 kDa and monomeric forms were detectable in similar proportion fiane 3). 

hr order to investigate further whether this -500 kDa species represents an S protein 
oligomer in native conformation, comparative analyses with virion-derived S glycoprotein 
derived from Veto cell cultures was performed. The purified virions were solubilised in 1% SDS 
20 prior to Western blot analyses after SDS PAGE. The presence of the -500 kDa spike protein 
oligomer was confirmed in virion particles (Fig. 79, lane 1). In addition, heat denaturation of 
solubilised virions produced the same oligomer-to-monomer conversion as seen with the full- 
length recombinant S (lanes 2,3). The oligomeric nature of virion S was further analysed in a 
cross-linking experiment Aliquots of inactivated virion from sucrose gradient fractions were 
25 treated with 10% SDS at 1 % final concentration and diluted 2-fold with 0.2M Triethanolamine- 
Ha (pH 8, Sigma); Dimethyl suberimidate (DMS; Pierce Oiemical Co.) was then added fiom a 
freshly prepared solution (lOmg/ml in 0.2M Triethanolamine-Ha) at 3.3mg/ml final 
concentration. After 2 hr at room temperature, samples were concentrated with Centricon-30 and 
analyzed by silver staining aftm- electrophoresis on a 4% polyacrylamide gel. Both untreated and 
30 DMS cross-linked virion proteins were heat-daiatured, and the heat effect on the maintenance of 

oligomer stmcture was analysed by SDS-PAGE and silver staining (Figure 80). hi the absence of 
cross-linking, heat denaturation resulted in the replacement of the -500 kD spike protein species 
witii the monomer species, hi contrast, in the cross-linked proteins, the levels of the -500 kD and 
monomer species did not chan^ significantly after heating. These data support the fact that the 
35 -500 kD protein is an oligomer of S monomer proteins that are bound non-covalently. After 

cross-linking and boiling, the -500 kDa species migrated as a somewhat slower diffuse form 
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than the untreated fomt This mobility shift is probably due to a structural change resulting from 
boiling. In addition, a minor protein species of ~300 kD, which may represent a non-dissociated 
S dimer, could be seen. 

To estimate more precisely the size of the recombinant ~S00 kDa S species expressed in 
5 COS7 cells, a COS7 ceU lysate containing the S protein oligomer was fractionated using size- 
exclusion colunm chromatography. The major portion of the ~500 KDa oligomer co-eluted with a 
572 kDa marker protein. Taken together, these experiments suggest that the ~500 kDa S species 
seen in COS7 ceU lysates is probably a homotiimer of the S protein monomer. 

The oligomeric status of the SA spike protein was also examined after expression in COS7 
10 cells. As shown in Hgure 81, die recombinant SA proteins present in ceU lysates were also 

detected in high molecular weight forms of ~500 kDa range when the lysate was not heated prior 
to SDS-PAGE and Western blot analysis (lane 1). However, the efficiency of oligomerization by 
intraceUular SA protein appears to be much less (<10%) compared to that of fuU-length S protein 
under the same western analysis conditions. A heat-seositivity test on this -500 kDa protein 
15 showed diat the SA oligomer was more heat labile than that of the Ml-length 5 oligomer, as 

demonstrated by the >90% conversion of aU of the ~500 kDa species into monomeric Sd forms 
at 80“C Qane 2). Also (Figure 82), the majority of the secreted SA protein was found in 
monomeric form with the ~500 kDa species barely detectable (and only, detectable when {he 
protein was loaded in excess for Western analysis) (lane 1). At a temperature above 80®C, all 
20 secreted SA proteins were detected as monomers (lanes 2, 3). 

The ~500kDa protein is glycosylated, and the effect of deglycosylation on its antibody 
binding was investigated. The recombinant COS7 lysate was treated with PNGase F under non- 
denaturing condition (as described above) and analysed by western blot As shown in Figure 83, 
deglycosylation did not affect the binding of anti-histidine Mab antibody to the treated S 
25 oUgomer (lanes 2,3). However, it compromised the reactivity with the rabbit antisera raised 

against purified virus Oane 6). This antiserum binds to virion-derived S in western blot analyses 
only when DTT is omitted from the sample for SDS-PAGE indicating that it recognizes 
primarily a discontinuous, conformational epitope(s). Ibis antisera has also been shown to have 
a higb-titer of viral neutralizing antibodies. Its lack of binding to deglycosylated, recombinant S 
50 suggests that the carbohydrate actively contributes to the higher order, native stmcture of the S 
polypeptide oligomer. 

The difference between the recombinant S and S A protein is the presence or absence of the 
TM-and Cys-rich domains at the C-terminus. This difference predicts that fuU-length S would be 
found associated widi the membrane fraction while Sd would be in the soluble fraction upon 
55 lysis of transfected cells. Therefore, nSh- or nShATC-transfected cells were lysed under 
hypotonic conditions and the soluble cytosol fraction was separated fix>m the insoluble 
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membrane fraction by centrifugation (Figure 48). As shown in Hguie 84, the S protein was 
found in Ihe membrane firaction (DF) both as a -500 kDa and 180/170 HDa species (lane 4) but 
was not detectable in the soluble cytosol fraction (AF) (lane 3). However, the truncated SA 
protein was found as a monomeric species (gpl70) in both fractions (lanes 5,6). This indicates 
5 that die C-terminal TM and Cys-iich domains ate required for the anchorage of the S protein to 
cell membrane. 

The cellular location of the S and SA proteins in COS7 cells was analyzed by indirect 
immunofluorescence microscopy. At 48 hr post-transfection, cells were directly fixed with 2% 
paraformaldehyde without detergent for cell surface staining or treated with detergent followed 
10 by Cytofix/Cytoperm solution for intracellular staining. Hbred cells were then stained with rabbit 
anti-SARS sera (2BE) and FTTC-conjugated antibody. The nSh-transfected cells showed foci of 
S protein indicative of Gdlgi-localisation (Figure 85A), while the nShATC-transfected cells 
displayed a uniform distribution of SA protein throughout the cytoplasm indicative of ER 
localisation (Figure 85B). While the complete S protein was also observed on the surface of 
15 transfected cells in unfixed cells (Figure 85D), the SA was undetectable on the cell surface 

(Rguie 85E). These results indicate the role played by the TM-and (Tys-rich domains in 
anchoring the S protein to the plasma membrane. Although the TM-region alone is likely 
responsible for membrane anchorage, the potential role played by the Cys-rich region remains to . 
be determined. 

20 The SARS recombinant full-length S protein is thus an N-Unked glycoprotein with an 

estimated molecular weight of 170-180,000 kDa. Deglycosylation with PNGase F resulted in a ' 

polypeptide of the expected size for the uncleaved, encoded polypeptide (140kDa). Both 
transient and stable expression of the full-length SARS-CoV S gene in a variety of mammalian 
cells, including COS7, 293, BHE21, and Huh7 cell lines, consistently produced a S protein 
25 doublet (gp 170/180) as detected in western blot analyses. Pulse-chase analyses of transfected 

cells demonstrated that the SARS CoV S protein was initiaUy synthesized as an Endo H sensitive 
gpl70 species followed by the gradual appearance of an Endo H resistant gpl80 form, 
presumably as a result of the addition of complex carbohydrate within the Golgi apparatus. 

The recombinant S protein was not secreted into the cell culture medium unless the C- 
30 terminal 60 amino adds containing the TM-region and the Cys-rich tail wore deleted. 

The quaternary strocture of the full-length recombinant S protein was investigated using 
cross-linking treatment, heat-denaturation, and size fractionation analyses. The results data are 
consistent with the recombinant S protein existing as a homotrimer of ~500kDa. Similar analyses 
of virion-derived S yielded the same results. Such a trimeric structure has been reported for other 
35 enveloped RNA viruses: the hemagglutinin HA of influenza virus, the E1-E2 heterodimer of 

alphaviruses and the G protein of vesicular stomatitis virus. Incubations under reducing 

-322- 




wo 2004/092360 



PCT/US2004/011710 



conditions indicate that the SARS-CoV S trimeric structure is non-covalently associated, and is 
very stable. S oligomers present in the cell lysate were shown to be resistant to reduction by 10 
mM DTT, detragent treatment witfi 1% SDS, and heat denaturation at up to 60"C . Incubation at 
a temperature hi^er than >80°C resulted in the dissociation of the trimeric complex as 
5 evidenced by the decrease in tiimer with the concomitant increase in the monomer bands. The 
temperature-induced appearance of the high-mannosylated gpl70 (ER monomer form) as well as 
the complex-glycosylated gplSO (Golgi monomer form) suggests that trimerization can occur 
before the transport of the monomer spike protein to the medial Golgi apparatus. Ibis is 
consistent with other reports for TGEV, influenza virus HA, and vesicular stomatitis virus G 
10 proteins. With these proteins, trimerization was reported to take place before addition of complex ■ 

oligosaccharides in the Golgi apparatus. 

. The C-terminally teuncated form of S was found in the cell lysate in both oligomeric and 
monomraic forms at a frequency of 10% and 90%, respectively. The truncated protein secreted 
into medium was found fully glysosylated and it was essentially aU in monomeric form. We 
IS conclude that the C-terminal 60 amino acids of the S glycoprotein contains a membrane anchor '> 

regidn that affects the efficiency of trimerization. In S protein trimerization, it is possible that the 
C-teiminal region is required to initiate the event and the triple-stranded coiled cod structures in - 
the S2 stalk domain provide further stabilizing force as seen in HA oligomer of influenza virus. . 

EXAMPLE 24: CHO cells for Spike protein expression 
10 CHO cell lines that stably express either the full-length or truncated SARS-CoV spike 

proteins were prepared. Several stably transfected CHO cell lines were obtained, and Figure 73 ■ 
shows western blot data from a panel of representative clones. 

EXAMPLE 25; Expression in E.coti 

All SARS-CoV ORFs (Figure 17, Table 10) were cloned in the pET vector and expressed 
25 as C-terminal His-Tag fusion proteins in E.coli. The proteins smaller than 16KD were also 
expressed as N-terminal GST (Glutathione S-transferase) fusion proteins using pGEX vector. 

Nspl and Nsp2, the two SARS-CoV proteins with proteolytic activity, were not expressed 
as full length proteins due to toxicity in E.coli. The respective genes were instead cloned in 
different portions in order to separate: the catalytic residues (Cys833/ His994 for Nspl; 

30 HLs41/Cysl45 for Nsp2) in the resulting recombinant proteins: NsplA from nucleotides 2719- 

5214 of AY310120; NsplB from nucleotides 5218-7371; NsplC fix)m nucleotide 7372-9984; 
Nsp2A fiom nucleotide 9985-10416; Nsp2B from nucleotide 10476-10902. 

Nsp9 (SEQ ID NO: 9775) was divided into two portions: Nsp9A from nucleotide 
13371- 14756; Nsp9B from nucleotide 14757-16166. 
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Matrix (M), ORE’S and ORI*7 contain respectively three, two and one transmembrane 
domains. Hiese proteins wrae expressed as deletion proteins excluding the first 100 amino adds 
(M and ORE’S) or the first 18 amino adds (ORE’7) that include the hydrophobic regions. 

Ihe cloned sequences are shown in Table 26. 

5 A two-step strategy was used to amplify the cloned sequences. In the first step, 

amplification of DNA fiagments containing more dian one gene or single gene used sequenced 
cDNA as template. Eleven cDNA sequ^ces were amplified: (1) a fragment, named amplCl, 
including genes coding for protein E, protein M, orf 7-8-9-10; (2) a fragment, named amplC2, 
including genes coding for orf S-4; (S) a fragment, named amplCS, including genes coding for 
10 proteins Nspl2 and NsplS; (4) Nspl Igene; (5) 1*28 and P65 genes; (6) NsplB and NsplC genes 
portion; (7) a fragment, named amplC9, including genes coding for proteins Nsp2 and Nsp3; (8) 
a fragment, named amplNsp4-7, including genes coding for proteins Nsp4, Nsp5, Nsp6, Nsp7 
and for amplification of Nsp9A gene portion; (9) Nsp 9B gene portion and NsplO gene; (10) a • 
fragment, named amplCO, including genes coding for proteins Orfl 1, Nucleocapsid (N) and 
15 Orfl2; (11) NsplA gene portion. The primes used in this first step are given in Table 27: 

. In the second step, amplification of single genes was performed using DNA fragments 
fiom the first amplification st^ as templates. The primers are shown in Table 28. 

Of the proteins where expression was seen, it was either in inclusion bodies (insoluble) or • 
in a soluble form. I*urification proceeded on appropriate material. Table 29 shows the molecular 
iO weight of the expressed fragments of SARS-CoV ORFs, whether they were cloned (+ or -), 
whether the cloned fragment was seen to be expressed (+ or — ) and the form of protein which 
was chosen for purification. 

Where a protein was a soluble His-tagged product, a single colony was streaked and grown 
overnight at 37®C on a UB/Amp (100 /xghnl) agar plate. An isolated colony fiem this plate was 
. 25 inoculated into 20ml of LB/Amp (100 /ighnl) liquid medium and grown overnight at 37°C with 
shaking. The overnight culture was diluted 1:30 into 1.0 L LB/Amp (100 /ig/ml) liquid medium 
and allowed to grow at the optimal temperature (30 or 37®Q until the OD550nm reached 0.6- 
0.8. Expression of recombinant protein was induced by addition of IPTG (final concentration 1.0 
mM) and the culture incubated for a furdier 3 hours. Bacteria were harvested by centrifugation at 
30 8000 X g for 15 min at 4°C. The bacterial pellet was resuspended in 10 ml of cold buffer A (300 

mM NaCl, 50 mM phosphate buffer, 10 mM imidazole, pH 8.0). Cells were disrupted by 
sonication (or French I*ress) on ice four times for 30 sec at 40W using a Branson sonifier 450 
and centrifuged at 13 OOOxg for 30 min at 4®C. Supernatants were mixed with 150/21 Ni^-resin 
(previously equilibrated witii buffer A) and incubated at room temperature with gentle agitation 
35 for 30 min. The resin was Chelating Sepharose Fast I’low (Pharmacia), prepared according to the 
manufacturer’s protocol. The batch-wise preparation was centrifuged at 700 x g for 5 min at 4”C 
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and the supernatant discarded. The resin was washed twice (batch-wise) with 10ml buffer A for 
10 min, resuspended in 1.0 ml bu£f(^ A and loaded onto a disposable column. The resin 
continued to be washed with buffer A at 4°C until the OD280nm of the flow-through reached 
0.02-0.01. The resin was further washed with cold buffer B(300 mM NaQ, 50 mM phosphate 
5 buffer, 20 mM imidazole, pH 8.0) until the the OD280mn of the flow-through reached 0.02-0.01. 

The His-fusion protein was eluted by addition of 700^1 of cold elution buffla: C (300 mM NaQ, 
50mM phosphate buffer, 250 mM imidazole, pH 8.0) and fractions collected until the OD 280 nm 
indicated all the recombinant protein was obtained. 20/tl aliquots of each elution fraction were 
analyzed by SDS-PAGE. Protein concentrations were estimated using the Bradford assay. 

10 Where a protein was seen as an insoluble product, the inclusion bodies were purified as 

follows; homogenize cells (5 g wet weight) in 25 ml O.IM Tris HQ pH 7, ImM EDTA, at 4“C 
using an ultraturrax (10000 rpm); add 1.5mg lysozyme per gram cells; mix shortly with an 
ultraturrax and incubate at 4“C for 30’; use sonication or high-pressure homogenization to 
disrupt the cells; to digest DNA, add MgCh to a final concentration of 3mM and DNase to a 
15 final concentration of lOug/ml and incubate 30’ at 25°C. add 0.5 vol of 60mM EDTA;^6% Triton 
x-100, 1.5M NaCl pH 7.0 to the solution, and incubate for 30’ at 4°C; centrifugation at 31000 g 
for 10’ at 4‘’C; re-suspend pellet m 40ml of 0;1M Tris HQ pH 7.0, 20 mM EDTA using 
ulttaturrax; centrifugation at 31000 g for 10’ a 4°C; store the IB pellet at - 20°C. 

The results of expression are shown in Figures 86 to 105. Examples of purity and yield are 
10 given in Table 30. 

EXAMPLE 26: Retention of critical epitope on truncated Spike antigen 

A human monoclonal antibody having neutralizing activity was tested in an ELISA assay 
for reactivity with the purified truncated Spike protein. Briefly, ELISA plates were coated with 
truncated form of the spike protein at a concentration of 1/ig/ml (100/il/ well) by incubating the 
15 plates ovauight at 4°C. The plates were washed, non-specific binding sites were blocked and 
then different dilutions of the antibody w«e added and plates were incubated for 1 hour at room 
temperature. At the end of incubation, the plates were washed and bound antibody was detected 
by using anti-human IgG conjugated to horse radish peroxidase (HRP) and an appropriate 
substrate. The optical density of each well was recorded at 405 nm'using an EUBA reader. The 
50 data are shown in Figure 69 and clearly demonstrate that the neutralizing epitope recognized by 

the mAb is preserved and exposed on the recombinant truncated Spike protein. 

EXAMPLE 27: D^erent Spike vaccines 

Purified truncated spike protein was used to i mmuniz e tnice and the level of binding 
antibodies induced against the truncated spike protdn was determined by ELISA assay. Briefly a 
55 group of 10 mice were immunized with 3/tg of truncated spike protein adjuvanted in MF59 at 0, 
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4 and 8 weeks intervals. Sraa samples were collected from these atiimalB and assayed for 
antibodies induced by truncated spike protein in an. ELISA assay. An additional group of 8 mice 
was i mmunize d with 75 ng of DNA ^coding the truncated form of the spike protein on PLG 
particles at 0, 4 and 13 weeks intervals, the sera were collected and analyzed as above for anti- 
5 spike antibodies as above 

Tlie profile of binding antibodies induced in each group was plotted as geometric mean 
titer (GMT). Compared to a plasmid DNA vaccine expressing truncated spike antigen and 
delivered using a PLG microparticle formulation, the purified truncated spike protein was 
significantly more potent for inducing strong antibody responses. Further comparison with the 
10 antibody responses induced by inactivated BPL-SARS-CoV (already shown protective) in the 
same mouse strain revealed that the magnitude of antibody responses induced by purified 
truncated spike protein and the inactivated virus vaccine are in the samp, range (Hgure 70). 

Hie neutralization potential of antibodies induced by the recombinant truncated spike 
protein, or plasmid DNA expressing the same spike antigen, were also evaluated. The GMT 
5 values obtained in both groups are showii in Figure 71. From these data, it appears that the 
purified protein is significantly more effective at inducing neutr alizing antibody responses 
against the SARS-CoV spike. Furthermore, the neutralization titers typically induced by the 
purified truncated spike protein are comparable to neutralization titras induced by an inactivated 
SARS-CoV vaccine. 

0 Figure 72 shows a comparison of antibody binding levels (EEJSA, X-axis) with 

neutralization titers (Y-axis). In general there is a very good correlation between the binding and 
neutralizmg antibodies. The bottom-left grouping shows ratios 2 weeks post-3rd immunization 
with the DNA vaccine; the top-right grouping shows ratios 2 weeks post-2nd immunization with 
the protein vaccine. Both forms of vaccine show a consistent correlation. 

5 In further experiments, the ability of a DNA vaccine to invoke an immune response in 

mice was studied. Mice were immunized with pCMV-nSdTC plasmid, either free or with PLG 
microp^cles. Serum from the mice was then used as the staining antibody against cultured 293 
cells that had been transfected with spike, either full-length or truncated. The cells were 
centrifuged prior to testing and the pellet was lysed. The antibody was tested against die culture 
3 supernatant and against the ceU lysate. As shown in Kgure 112, the mouse serum was able to 
detect spike protein in the lysate of cells that expressed full-length spike and in the supernatant 
of cells that expressed the truncated spike protein. Results were comparable to the staining seen 
when using rabbit serum that had been obtained after immunization with whole killed virus. 

Hius anti-spike antibodies can be induced by the use of DNA vaccination. 
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EXAMPLE 28: Expression cassettes in pCMV 

The sequence of plasmid pCMVKm2 is given as SEQ ID NO: 9923. Genes encoding the 
spike protein either in fuU-length form (pCMVKm2 SAKS Spike nS; SEQ ID NO: 9921) or in its 
ATC fonn (pCMVKm2 S ARS Spike nSATC; SEQ ID NO: 9922) were inserted into this basic 
5 vector. 

Mice were immunized with these vectors, and with similar vectors encoding the N, M or E 
proteins. Vectors encoding the same proteins but with optimized codon usage were also 
prepared. Codons wctb optimized for efficient human expression starting from the FRA 
sequence (GenBank: AY3 10120). The optimized sequences are: N (SEQ ID NO: 9924); M (SEQ 
10 ID NO: 9925); E (SEQ ID NO: 9926). 

After administration, expression of proteins could be detected by immimofluorescence in 
aU cases. For ex^ple. Figure 106 shows immunofluorescence (using anti-SARS rabbit serum) 
results after administration of the vector encoding optimsed N antigen, revealing high level 
expression. Mice receiving the control vector alone showed no fluorescence. 

15 ■ Figure 107 compares immunofluorescence (using Abgent anti-M antibody) of the native M 

sequence (107 A) or the codon-optimsed M sequence (107B). Similarly, Figure 108 compares 
immunofluorescence (using Abgent anti-E antibody) of the native E sequence (108A) or the 
codon-optimsed E sequence (108B). , 

Four groups of mice (8 mice per group) were immunized with: (1) SARS nS Spike, nSdTG 
iO truncated Spike, and N proteins; (2) pCMV-SARS-nSdTC: DNA+DNA-PLG at weeks 0,4 and 
13 wks; (3) CMV-nS: DNA+DNA-PLG+VEE/SIN Rep at 0, 4 and 9 wks; (4) VEE/SIN Rep- 
SARS-nS three times at 0, 4 and 13 wks. Sera from all groups recognized SARS nS and nSdTC 
proteins, and also showed virus binding and neutralization activity. 

EXAMPLE 29: Spike protein cleavage 

J5 To investigate the effect of proteolytic cleavage on SARS-CoV Spike protein, it was 

expressed in various forms in Kcoli, including; (1) full-length S1-S2; (2) SI alone; (3) HRl 
heptad; and (4) HR2 heptad. The expressed proteins were used to raise immnnp. rabbit sera, 
which were then used for visualizing western blots of Vero cells, either infected or not infected 
with SARS-CoV. 

to Figure 109 shows a western blot using a 1: 10000 dilution of antibody raised against either 

the SI domain or the uncleaved S1-S2 domains. Figure 110 shows a western blot using a 
1:10000 dilution of antibody raised against each of the four proteins. Ibe difference in antigen 
reactivity is readily apparent 

Figure 111 shows similar data. Each serum was tested against four lanes, with those gour 
!5 lanes being from left to right (a) serum at 1 :500 dilution, S ARS-CoV-infected cells; (b) serum at 



- 327 - 




wo 2004/092360 



PCT/US2004/011710 



1:500 dilution, non-iofected cells; (c) senim at 1:2500 dilution, SARS-CoV-infected cells; (d) 
serum at 1:2500 dilution, non-infected cells. Again, the difference in antigen reactivity is readily 
apparent 

Kgures 109-111 show diat the Spike protein exists in various forms in infected Vero cells, 
5 with sizes of approx. 75kDa, 90kDa, ISOkDa and >250kDa. TTie Spike protein is cleaved (at 
least partially) eidier intraceUulary or after release of the particles. 

If enzymatic cleavage of the mouse hepatitis coronavirus spike protein is inhibited then 
cell-cell fusion (syncytia formation) is also inhibited, but virus-cell fusion is not (de Haan et al 
(2004) J Virol). Syncytia are observed in vivo in the lungs of SARS-infected patients, but are not 
10 seen in Vero ceD cultures of the SARS-CoV. Inhibition of Spike protein cleavage may thus be 
used to prevent syncytia formation and related pathology, even though viral infectivity may not . 
be blocked. •: 

Example 30; Purification of SAKS protease 

Cells were grown at 37°C to mid-log phase and induced with 0.2% L-arabinose. Cells w«e 
15 harvested by centrifugation, and the cells resuspended in lysis buffer (LB) containing 20 mM 
Tris pH 7.5, 500 mM NaCl, 5% glycerol V/V, 0.05% Triton X-100, 5 mM PME, 5 mM 
imidazole, and'complete protease inhibitors (— )EDTA. Benzonase was added to a final- 
concentration of 50U/ml of lysate. Cells were then lysed using two passes through a pre-chilled 
microfluidizer. The lysate was clarified by high speed centrifugation at 44,000 x g. Clarified 
10 lysate was applied to a prepared Pharmacia chelating EF column charged with nickel sulfate. 
After application of the lysate the col umn was washed with 5 column volumes of LB, followed 
by 5 column volumes of LB supplemented with 45 mM imidazole. The column was then eluted 
using LB supplemented with 250 mM imidazole. Purity of the isolated SARS protease was 50%. 
Fractions containing protease were pooled, adjusted to 5 mM EDTA, and then applied to a 
15 Superdex 200 gel filtration column equilibrated in 20 mM Tris pH 7.5, 150 mM NaCl, 5% V/V 
glycerol, 0.05 % Triton X-100, and 5 mM DTT. Purity of the isolated SARS protease was 70%. 
Again, fractions containing the protease were pooled, and then stored at -80°C until used. 
Activity assay, mass spectrometry and western blot analysis were used to positively identify the 
protein (FIG 133). All steps were carried out with pre-chilled buffers, and kept at 4°C for as - 
30 much of the preparation as possible. 

Western of SARS Protease Purification Ructions 

Protocol: Briefly, protein concentration was based on Absorbance at 280 nm, and coomassie 
stained gel estimates of purity. Protein was run on a 4-20% gradient gel, and transferred to 
nitrocellulose. The blot was then blocked with 3% BSA, probed with Mouse IgG and-pentaHis, 
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and then probed with a secondary antibody to Mouse IgG conjugated with HRP. The blot was 
visualized using an ECL kit CPhaimacia Biotech). The results are shown in Figure 133 where A 
is the sizing column pool loaded at SO, 100 and 200 ng of target protein and B is the inounobilized 
metal afBnity column pool loaded at SO, 100 and 200 ng of target protein. 

S 

Example 31: Continuous Fluorescence Resonance Energy Transfer (FRET) Enzyme Assay 
The peptide containing EDANS, the fluorescence donor, and DABCYL, the fluorescence 
quencher (DABCyi/-WViS7I/j2 ^GTdRiT-EDANS) was synthesized by Syn. Pep. (Dublin, CA). 
The ^ptide contains the cleavage site Gln-Ser in the middle. Meyers, G. et al. Handbook of 
10 Proteolytic Enzymes and Barrett, A et aL Academic Press, London, 1998, 726-728. The 
proteolytic activity of SARS protease was followed kinetically by measuring the level of 
formation of cleaved product that contains the fluorescence donor, SGLRK-EDANS using the 
Hitachi fluorometer (F-4S00 FL Spec.) set at 340 nm excitation and 490 run emission wave 
length. 5 pL of 5 mM peptide stock in DMSO solution was ^ded to the reaction mixture, 
15 containing 295 pi of standard buffer (75 mM Tris-Hcl, 25 mM NaOAc, 25 mM Bis-Tris, 25 mM 
glycine, 5 mM EDTA, and 1 mM EDTA, pH 7.4) and 100 ul of buffer or 100 ul of 3.6 uM 
protease stock solution. The kinetic curve was followed for 6 minutes (the reaction was linear 
with R2 value of 0.998 (FIG 134)). The formation of fluorescence (proteolytic reaction) is likely 
enzyme dependent, as concentration of enzyme was tripled three times as much fluorescence was 
20 formed in the 6 minutes time flame. 

It will be understood that the invention has been described by way of example only and 
modifications may be made whilst remaining within the scope and spirit of the invention. 



25 
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8189 



1485 



1485 



1495 



1^31 



1497 



13155 



14981 



1496 



1497 



20033 



1498 



3650 



9365 



25645 



3647 



25398 



9256 



18712 



25398 



18220 



18223 



25398 



18231 



18233 



18233 



18220 



3846 



5306 



18223 



18712 



13325 



9249 



24740 



1^ 



9340 



24936 



9249 



26753 



26753 



18223 



25080 



9109 



25086 



17035 



17035 



10166 



13305 



10336 



3164 



19895 



8170 



1465 



1466 



1474 



18209 



1476 



13137 



1478 



1478 



1480 



20016 



1481 



3631 



93471 



25626 



3628 



253781 



9237 



18693 



25378 



18202 



18206 



25378 



18210 



18214 



18215 



18201 



36251 



5288 



1^ 



18693 



13305 



9231 



247171 



18209 



9319 i 



24919 I 



9230 ! 



26733 



26734 I 



18206 i 



25062 ! 



9087 



250691 > 



51.8| 45.51 



51.8 45.5 



0.11 495 



0.8 495 





-366r 
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mm 






1445 



1446 



14472467 



1448 



1449 2469 



14502470 



1451 2471 



14522472 



14532473 



1454 



1455 



145612476 



1457 2477 



1458 2478 



1459 2479 



1460 



1461 2481 



14^2482 



1463 



1464I2484 



14652485 



1466 



146712487 



1468 



1469 



1470 



1471 2491 



14722492 



1473 



147412494 



1475 



1476 



1477 2497 



1478 



147912499 



1480 2500 



1481 2501 



1482 2502 



1483 2503 



1484 2504 



1485 2505 



1486 ^ 



1487 2507 



1488 2508 



1489 2509 



1490 2510 



1491 2511 



1492 2512 



1493 2513 



1494 2514 



1495 2515 



IBII 



18011 



24481 



24481 



24482 



244^ 



24482 



mu 



18011 



18011 



18011 



24419 



18008 



24420 



12232 



4644 



18009 



18010 



24418 



24418 



9351 



1801 II 



13176 



3229 



25772 



3229 



13176 



25771 



24418 



25769 



18009 



18009 



24418 



18009 



24418 



25363 



25363 



25354 



18005 



18005 



25354 



18005 



18003 



13176 






24502 



24502 



24502 



24502 



24502 



18030 



1^ 



18030 



18030 



24440 



18029 



24440 



12250 



4^ 



1^ 



18029 



24439 



24439 



9370 



18029 



13196 



3248 



25793 



3248 



13196 



25790 



24436 



25786 



18028 



18028 



24436 



18028 



24436 



25381 



25381 



25372 



18024 









55 



45.5 



45.514967 



42.9 



42.9 



42.9 



55 4971 



55 4972 



55 4973 



55 4974 



45.5 4975 



50 4976 



42.94977 



52.64978 



45.54979 



52.4 



50 4981 



45.5 



45.5 



50 



52.6 



47.6 



50 4987 



40.9 



50 



47.6 



4514991 



47.4 



50 



55 



55 



47.4 



5514997. 






]E1SII 



lEBSEI 




SI 



gggili 



52.6 



50 



50 



52.6 



50 



52.4 



47.6 



40.9 



52.6 



45.5 5010 



47.6 5011 



45.5 5012 



45 









TiT*! 



18220 



24815 



2^)81 



2S)82 



25085 



25086 



18223 



18231 



18233 



18233 



24815 



18220 



24527 



12994 



5306 



18712 



18223 



24527 



248151 



10017 



18229 



13314 



3497 



26182 



3500 



13323 



26183 



24526 



26182 



18231 



18233 



25082 



1^ 



25209 



25650 



^651 



25548 



18223 



18231 



25651 



18233 



18712 



13326 



9311 



25832 



9109 



10017 



8933 



18632 









18202 



24792 



2S)63 



25064 



25(^ 



25069 



18206 



18210 



1^4 



18215 



24792 



18201 



24507 



12976 



5288 



18693 



18205 



24507 



24792 



9999 



18209 



13297 



3478 



26161 



3481 



13304 



26163 



24506 



26161 



18210 



1^14 



25064 



18215 



25190 



25631 



25634 



25531 



18206 



18210 



25632 



18215 



18693 



13306 



9292 



25811 



9087 



9999 



8916 



18611 
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1496IS16 



1497 2517 



1488 2518 



1499 2519 



1500 



1501 2521 



1502 



1503 



1504 



15(»2525 



1506 2526 



15072527 



1508 2528 



1509 2529 



15102530 



1511 2531 



1512 2532 



1513 



1514 



1515 



1516 2536 



2537 



2538 



2539 



mm i 

^31 






1521 2541 



1522 2542 



1523 2543 



15242544 



1525 2545 



1526 2546 



1527 2547 



1528 2548 



2549 



2550 



1531 2551 



15^ 2552 



1533 2553 



1534 2554 



1535 2555 



1536 2556 



1537 2557 



1538 2558 



1539 2559 



1540 



1541 2561 



1542 



1543 



1544 



1545 



1546 






^1 



51.2|47.4|5016 



50.9 50 5017 



51.6 40.95018 



5121 47.4 5019 



512 47.4 5020 



52 52.6 5021 



50.4 40.9 5022 



50 42.9 5023 



50 42.9 5024 



50 42.9 5025 



50 42.9 5026 



50 42.9 5027 



50.3 40 5028 



50.4 50 5029 



50.3 50 



51.3 47.4 



51 45 



51.6 40.9 



51.4 45.5 



50.3 40 



50 42.9 



50 42.915037 



50.9 



50.9 



50 42.9 



50 42.915041 



50 42.9 5042 



522 526 5043 



522 5265044 



523 40.95045 



525 47.6 



50.9 4295047 



50.4 40.9 5048 



50.9 429 5049 



51.8 50 5050 



51.4 40.9 5051 



51.6 45 



51.1 526 



50.2 4515054 



51.6 429 



51.9 52615056 



45 5057 



50 



45 5059 



55 



521 55 5061 



50.5 47.4 



51.7 526 



50.5 526 



50.3 505065 



50.1 40 5066 












































































































































































































































































































































































































































































































































































































































































































































































Mil 

Mil 

Mil 



1^1 



1547 



1548 



1549 



1550 ^ 



1551 2571 



15522572 



1553 2573 



1S4 2574 



1555 2575 



1556:^6 



1557 2577 



1558 2578 



1559 2579 



1560 



156112581 



15^ 



1563 



1564 



1585 



15662586 



1567 2587 



1568 



1569 2589 



570 2590 

571 2591 



1572 2592 



1573 



15742594 



157512595 



^96 



1577 2597 



1578 2598 



15792599 



15«) 



1581 12601 



1582 



1583 



1584 



1585 



1586 



158712607 



1588 



1589 



1590 2810 



1591 2611 



1592 2612 



1593 2613 



1594 2614 



1595 2615 



15962616 



1597 2617 



immi 

IMII 

Mil 

^31 

1 1 



29182 



23841 



3404 



29183 



23841 



23838 



23838 



23838 



^181 



29181 



29180 



985 



985 



8859 



29178 



8859 



16909 



8794 



8794 



985 



180171 



4593 



18017 



6155 



6158 



3232 



3232 



28523 



28965 



8866 



28518 



6165 



6264 



18074 



6271 



18074 



6274 



18074 



5 



6343 



3800 



7615 



7723 



1 



7725 



18074 



3168 



18074 



13177 



28190 



28190 



Ml 

Ml 

mm 



43^ 5067 



52.6 



47.4 



40.9|5070 



52.65071 



50 5072 



50 5073 



50 5074 



47.65075 



47.6 5076 



4^95077 



50 5078 



50 5079 



42.9 



42.9)5081 



42.9 



45 



45 



45 



.50 



5515087 



47.6 



555089 



50 



42.95091 



50 



50 



40.9 



55 



45 



42.915097 



52.6 



50 



45 5100 



47.65101 



45 5102 



45 



45 5104 



^65105 



45.55106 



4E95107 



47.65108 



52.65109 



52.6 5110 



50 5111 



42.9 5112 



45.55113 



42.95114 



42.95115 



55 5116 



55 5117 



Bl 



Mil 

Mil 

Mil 

mm i 

Mil 



^358 



24498 



3646 



^14 



24527 



24093 



24496 



24498 



^358 



»414 



29358 



1498 



1497 



9254 



29414 



9340 



17109 



8919 



8920 



1496 



18223 



4994 



1^ 



6486 



6486 



3500 



3497 



29298 



29358 



9256 



28672 



6486 



6483 



18233 



6483 



1^1 



6483 



18223 



314 



6486 



4445 



7821 



8049 



314 



7856 



18233 



3503 



18231 



13312 



28671 



2^ 



29339 



24479 



3825 



293% 



24508 



24075 



24478 



24479 



29339 



29395 



29339 



1481 



1480 



9236 



29395 



9319 



17089 



8901 



8902 



1478 



18205 



4974 



18201 



6467 



6467 



3481 



3478 



29279 



29339 



9237 



28654 



6467 



6463 



18215 



6463 



18210 



6463 



18206 



296 



6467 



4425 



7798 



8032 



296 



7836 



18215 



3484 



18210 



13294 



28652 



28654 
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wo 2004/092360 
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28190 


2 ^ 


51.7 


526 5118 


2^1 


28652 


52.8 


55 


1 


A62 


79.9 


52.1 


1599 2619 


28190 


28208 


51.7 


526 5119 


2^1 


28653 


502 


52.6 


15 


482 


79.9 


521 


1600 2 ^ 


18074 


16094 


51.1 


42.9 5120 


18223 


18206 


51.8 


50 


0.7 


150 


732 


423 




28185 


28205 


53.5 


47.6 5121 


28284 


2 ^ 


1 ^ 


50 


0.6 


100 


74.5 


52 




28187 


28205 


53.1 


526 5122 


28672 


28653 


51.8 


55 


1.2 


486 


79.9 


522 


1603 2623 


2371 


2389 


50.3 


47.4 5123 


2900 


2881 


50.1 


45 


0.2 


530 


76.8 


44.3 


1604 2624 


2371 


2389 


50.3 


47.4 5124 


3052 


3033 


50.3 


50 


KOI 


%2 


76.7 


4 a 4 


1605 2625 


2371 


2389 


50.3 


47.4 51 ^ 


3056 


3038 


50.8 


52.6 


0.5 


^6 


76.7 


4 a 4 


1606 2626 


18074 


1«)95 


522 


45.5 5126 


18223 


18205 


53.3 


5 Z 6 


1.1 


150 


732 


4 a 3 


1607 2627 


2220 


2239 


512 


45 5127 


2891 


W73 


50.8 


47.4 


0.5 


672 


76.8 




1608 2^8 


18077 


18097 


515 


47.6 5128 


18662 


18641 


50.4 


40.9 


1.1 


586 




1609 2629 


28117 


28135 


50.6 


526 5129 


28671 


28653 


502 


52.5 


0.4 


555 




51.9 


1610 2630 


28116 


^134 


50.8 


47.4 5130 


28671 


28653 


502 


52.6 


0.6 


556 


79.9 


51.8 


1611 2631 


12232 


12250 


51.9 


526 5131 


13000 


12981 


51.1 


45 


0.8 


769 


76.5 


42.5 




18080 


18098 


512 


52.6 5132 


18702 


18685 


502 


50 


1 


623 


762 


42.4 


1613 2633 


12232 


12250 


51.9 


52.6 5133 


1 ^ 


12980 


50.6 


40 


1.4 


768 


76.4 


424 


1614 2634 


28820 


28840 


54.8 


47.6 5134 


29306 


^85 


56.7 


54.5 


1.9 


487 


77.1 


45.4 




28820 


28840 


54.8 


47.6 5135 


23306 


29287 


54.6 


55 


0.3 


487 


77.1 


45.4 




180 ^ 


18098 


512 


52.6 5136 


18642 


18622 


50.5 


42.9 


0.7 


563 


762 


426 




8865 


8884 


50.4 


45 5137 


9249 


9231 


50.8 


47.4 


0.4 


385 


75.1 


41.3 




8865 


8884 


50.4 


45 5138 


9249 


9230 


51.5 


45 


1.1 


385 


75.1 


41.3 




8865 


8884 


50.4 


45 5139 


9109 


9087 


50.5 


43.5 


0.1 


245 


73.9 


‘ 40.8 


1^2640 


28819 


28839 


56.6 


52.45140 


29306 


29285 


56.7 


54.5 


KOI 


488 


772 


• 45.5 


1621 2641 


9130 


9151 


52 


40.9 5141 


9364 


9346 


53.9 


W^\ 


KOI 


235 


74.8 


424 


1622 2642 


28820 


28839 


54.3 


50 5142 


29306 


29287 


54.6 


mm 


0.3 


487 


77 .f 


45.4 


1623 2643 


8865 


8884 


50.4 


45 5143 


9248 


9229 


50.1 


45 


0.3 


384 


75 


41.1 


1624 2644 


18080 


18098 


512 


526 5144 


18229 


18209 


50.1 


42.9 


1.1 


150 


732 


■ 43.3 


1625 2645 


15752 


15772 


50.8 


47.6 5145 


16175 


16155 


51.8 


47.6 


1 


424 


75.1 


41 


1626 2646 


12232 


12250 


51.9 


526 5146 


12498 


12480 


50 


47.4 


1.9 


267 


74.7 


42.3 


1627 2647 


18078 


18098 


51.5 


47.6 5147 


18223 


18205 


53.3 


52.6 


1.8 


146 


73 


432 


1628 2648 


7833 


7853 


50.7 


47.6 5148 


8054 


8035 


50.4 


50 


0.2 


222 


74.6 


432 


1629 2649 


230 


248 


512 


52.6 5149 


713 


695 


50.7 


47.4 


0.5 


484 


79.3 


50.6 


1630 2650 


1472 


1491 


512 


45 5150 


2153 


2134 


50.4 


45 


0.8 


682 


76.5 


42.8 


1631 2 K 1 


18076 


18098 


54.4 




1 ^ 


1^1 


56.1 


55 


1.8 


145 


73.1 


43.4 




1442 


1461 


51.6 


55 5152 


1694 


1673 


51.7 


40.9 


0.1 


253 


75.9 


45.5 


1633|2653 


28618 


28636 


52.5 


52.6 5153 


29 ^ 


29339 


52.8 


50 


0.3 


741 


782 


47 




940 


959 


5 a 3 




1701 


1677 


54.7 


40 


1.6 


762 


77.1 


442 




18076 


18097 


53.1 


45.5 5155 


18696 


18672 


53.9 


40 


0.8 


%i 


762 


424 




940 


959 


56.3 


55 5156 


1697 


1673 


54.4 


40 


1.9 


758 


772 


44.3 


163712657 


3016 


3036 


50.2 


429 5157 


3188 


3167 


502 


40.9 


0.1 


173 


74.5 


45.1 




18077 


18097 


512 


47.6 5158 


18696 


18673 


53.4 


41.7 


1.9 


620 


762 


424 


1639 2659 


18077 


18097 


51.5 


47.6 5159 


18697 


18679 


51.9 


52.6 


0.3 


%i 


762 


425 


1640 2 ^ 


9352 


9371 


50.6 


45 5160 


99 ^ 


99 % 


51 


40.9 


0.4 


638 


75.4 


40.3 


1641 2661 


6042 


6062 


50.4 


47.65161 


6374 


6353 


50 


40.9 


0.3 


333 


74.6 


40.8 


1642 2662 


942 


960 


521 


52.65162 


1697 


1678 


50.3 


45 


1.8 


756 


772 


44.3 




942 


960 


521 


52.65163 


1697 


1677 


51 


42.9 


1.1 


756 


772 


442 


1644 2664 


6042 


Boez 


50.4 






6273 


50.8 


45 


0.4 


251 


73.9 


40.6 


1645 2665 


942 


960 


521 




1697 


1676 


51.7 


40.9 


0.5 


756 


772 


44.3 




942 


960 


K .1 


5265166 


1694 


1673 


51.7 


40.9 


0.4 


753 


772 


44.4 


1647 2667 


13176 


13196 


51.4 


47.6 51 W 


13749 


13727 


50.5 


435 


0.9 


574 


762 


42.7 


1648 2668 


13176 


13198 


51.4 


47.6 51 ^ 


13949 


139 % 


51.6 


50 


0.2 


774 


75.9 


41.1 




m 



m 

m 










m 









56.5 


56.1 


51.6 


53.1 


56.6 


53.9 


53.9 


53.9 


52 


54.1 


53.6 


56.1 


56.1 


54 


53.5 


53.8 


55.5 


55.5 


53.6 


52.8 


52:8 


51.9 


56.1 


53.1 


• 55.4 


52.7 


51.4 


52.9 


524 


51.6 


52.4 


55,8 


53.7 


526 


53.7 


55.6 


55.5 


54.4 


55.5 


52.3 


53.9 


53.9 


53 


52.3 


542 


IBS 




54.6 


54.7 


53.6 



53.6 
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2447 



27455 



2447 



2445 



27407 



2445 



18636 



2395 



18611 



12384 



2395 



17001 



2395 



18608 



2395 



8240 



12388 



2244 



2239 



24439 



18603 



2239 



1422 



1375 




0.51 165 



0.1 1 473 



777 



631 



0.2 231 



1.3 629 



0.5 627 



1.6 234 



0.5| 151 



154 



1.9 628 



05 762 



1 106 



1.7 764 



0 630 



1.1 153 



1.4 764 



1.4 701 



0.8 296 



1.4 626 



0.7 374 



1.5 296 



0.1 130 



2 



0.3 



1.61 515 



0.4 716 



0.1 629 



1 453 



0.4 453 



1.1 519 



0.2 631 



0.8 456 



0.2 752 



1.7 798 





Forward primer SEQ 
ID NO & Co-ordinates 



Reverse primer SEQ 
ID NO & Co-ordinates 



Tm(FOR&REV)(°C) 
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1157-1175 



1420-1441 



1685-1707 



1740-1764 



2007-2025 



2226-2245 



2428-2446 



2742-2763 



2823-2844 



3007-3031 



3234-3254 



3453-3476 



3601-3622 



4007-4027 



4141-4165 



43604387 

4488-4508 



4658-4677 



4902-4922 



5239-5260 



53605389 



5593-5612 



. 6042-6062 



1479-1460 



1700-1680 



1834-1811 



1987-1963 



2251-2232 



2385-2366 



2749-2728 



2893-2875 



3082-3058 



3185-3164 



3497-3478 



3647-3627 



4158-4135 



4316-4295 



4567-4544 



4708-4690 



4994-4974 



5115-5092 



5450-5430 



5860-5836 




6106 




6201 


6483-6463 


51.1 


50.2 


213 


6107 


7017-7040 


6202 


7171-7153 


52.4 


52.8 


155 


6108 


7253-7272 


6203 


7504-7486 


50.3 


50.3 


252 




7415-7434 


6204 


7677-7654 


54.5 


53.6 


263 


6110 


7615-7635 


6205 


7821-7798 


51.1 


52.8 


207 


6111 


7728-7746 


6206 


7936-7915 


51.7 


50.1 


209 


6112 


7845-7867 


6207 


7994-7970 


52.7 


53.4 


150 


6113 


8011-8029 


6208 


8189-8170 


51.4 


50.6 


179 


6114 


8143-8166 


6209 


8300-8281 


52.2 


50.8 


158 


6115 


8221-8239 


6210 


8388-8369 


51.0 


51.1 


168 


6116 


8553-8575 


6211 


8931-8915 


51.8 


50.3 


379 


6117 


8867-8886 


6212 


9254-9236 


50.7 


50.6 


388 
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6118 


9244-9267 


6213 


9597-9573 


51.9 


53.4 


6119 


9620-9640 


6214 


9990-9969 


51.3 


51.3 


6120 


10009-10027 


6215 


10188-10171 


50.2 


50.2 


6121 


10093-10113 




10244-10223 


52.4 


50.6 


6122 


10242-10265 




10608-10589 


51.2 


51.0 




10549-10571 


6218 


1 0783-1 07d3 


53.7 


55.2 




10766-10785 


6219 


10930-10912 


52.0 


51.1 




11065-11085 


6220 


11305-11287 


50.7 


50.0 




11265-11287 


6221 


11429-11405 


54.5 


53.5 


6127 


11552-11571 


6222 


11730-11709 


52.0 


50.4 


6128 


11705-11726 


6223 


11869-11848 


50.1 


50.2 


1 ^^ 


11801-11824 


^1 


11984-11967 




50.4 


6130 


12040-12058 


■ 6225 


12254-12235 


52.3 


51.9 


6131 


12235-12253 


6226 


12406-12388 


50.1: 


50.1 


6132 


12366-12384 


6227 




51.7 


52.2 


.6133 


12727-12748 




12994-12976 


50.8 


50.3 


6134 


12948-12966 




13224-13201 


50.7 


51.7 


6135 


, 13175-13196 


6230 


13324-13300 


54.3 


55.1 


6136 


13237-13258 


6231 


13545-13526 


52.9 


52.9 


6137 


13790-13810 


6232 


13963-13945 


50.9 


50.7 


6138 


14080-14098 


6233 


14280-14257 


51.5 


51.0 


.6139 


14405-14427 


6234 


14561-14540 


50.2 


50.9 


6140 


14882-14906 


6235 


15046-15024 


50.9 


51.5 


6141 


14951-14976 


6236 


15145-15124 


53.1 


52.9 


6142 


15113-15134 


6237 


15275-15257 


51.6 


50.8 


6143 


15211-15230 


6238 


15383-15363 


50.2 


50.1 


6144 


15364-15387 


6239 


15528-15506 


54.0 


52.1 


6145 


15456-15477 


6240 


15605-15585 




53.2 


6146 


15513-15532 


6241 


15897-15876 




50.4 


6147 


15837-15856 


6242 


15999-15978 


52.3 


50.8 


6148 




6243 


16301-16277 


51.7 


52.8 


6149 


16245-16266 


6244 


16404-16380 


50.3 


52.0 


6150 


16366-16385 


6245 


16515-16492 


52.9 


53.8 


6151 


16553-16571 


6246 


16777-16758 


53.4 


51.5 


6152 


16832-16852 


6247 


17026-17004 


51.0 


51.6 
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6153 


16982-17001 


6248 


17359-17340 


51^ 


50.2 


378 


6154 


17354-17372 


6249 


17511-17490 


51.3 


50.4 


158 


6155 


17422-17443 


6250 


17573-17552 


50.2 


51.1 


152 


6156 


17603-17623 


6251 


17769-17748 


50.7 


51.5 


167 


6157 


17728-17746 


6252 


17883-17862 


50.9 


51.2 


156 


6158 


18011-18030 


6253 


18163-18140 


52.9 


51.9 


153 


6159 


18076-18098 


6254 


18225-18205 


54.4 


55.0 


150 


6160 


18270-18292 


6255 


18432-18413 


51.9 


51.4 


163 


6161 


18352-18373 


6256 


18648-18629 


51.3 


50.8 


297 


6162 


18550-18571 


6257 


18702-18684 


50.4 


51.9 


153 


6163 


18720-18738 


6258 


19004-18983 


50.6 


51.0 


285 


6164 


18960-18981 


6259 


19109-19085 


54.7 


54.3 


150 


6165 


19065-19089 


6260 


19217-19195 


52.8 


51.7 


153 


.6166 


19310-19329 


6261 


19476-19454 


50.2 


52.1 


167 


. 6167 


19569-19589 


6262 


19719-19701 


50.5 


51.8 


151 


6168 


19707-19731 




19856-19833 


55.7 


55.9 


. 150 


6169 


19771-19792 






50.1 


50.2 


151 




19833-19851 


6265 


1998&-19966 


50.9 


50.7 


154 



Table 6: Primers 



Forwai 
ID NOl 


rd primer SEQ 
Co-ordinates 


Revers 
ID NOl 


te primer SEQ 
Co-ordinates 


T„ (FOR & REV) (®C) 


Product 
length (bp) 


6266 


20110-20132 


6305 


20425-20404 


51.9, 


50.9 


316 




20468-20492 


6306 


20617-20596 


53.2 


53.5 


150 


6268 


20557-20578 


6307 




50.4 


50.6 


335 


6269 


20838-20856 


6308 


21037-21015 


52.5 


52.0 


200 




21096-21116 


6309 


21295-21272 


50.1 


51.7 


200 


@31 


22173-22194 


6310 


22414-22395 


52.4 


51.0 


242 


6272 


22320-22342 


6311 


22501-22479 


54.8 


54.3 


182 


6273 


22532-22552 


6312 


22695-22675 


50.6 


50.0 


164 




22712-22736 


6313 


22873-22852 


56.7 


55.5 


162 




22842-22861 


6314 


23086-23067 


51.0 


52.8 


245 


6276 


23151-23170 


6315 


23395-23376 


51.4 


50.3 


245 


6277 


23307-23326 


6316 


23524-23501 


51.1 


51.1 


218 


6278 


23615-23635 


6317 


23776-23758 


50.7 


50.2 


162 
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Table 7: Primers 

Name SEQ ID NO: Co-ordinates Name I SEQIDNO: I Co-ordinates 

AB4f 6344 19869-19888 CBIr 6367 28011-28030 

AB51 6345 20238-20257 CB2r 6368 27671-27690 

BCIf 6346 20581-20600 CB3r 6369 27301-27320 

BC2f 6347 20950-20969 CB4r 6370 26931-26950 

BC3f 6348 21339-21358 CB5r 6371 26575-26594 

BC4f 6349 21708-21727 CB6r 63^ 26191-26 210 

BC5f 6350 22041-22060 CB7Y 6373 25841-25860 

BC6f 6351 22410-22429 CB8r 6374 25476-25495 

BC7f I 6352 22759-22778 CB9r 6375 25126-25145 
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BC8f 


6353 


23131-23150 




CB10r 


6376 


24791-24810 


BC9f 


. 6354 


23500-23519 




CBIIr 


6377 


24422-24441 


BClOf 


6355 


23841-23860 




CB12r 


6378 


24031-24050 


BC11f 


6356 


24210-24229 




CB13r 


6379 


23673-23692 


BC12f 


6357 


24560-24579 




CB14r 


6380 


23298-23317 


BC13f 


6358 


24941-24960 




CB15r 


6381 


22928-22947 


BC14f 


6359 


25310-25329 




CB16r 


6382 


22567-22586 


BC15f 


6360 


25675-25694 




CB17r 


6383 


22196-22215 


BC16f 


6361 


26044-26063 




CB18r 


6384 


21831-21850 


BC17f 


6362 


26413-26432 




CB19r 


6385 


21431-21450 




6363 


26763-26782 




CB20r 


6386 


21073-21092 




6364 


27132-27151 




CB21r 


6387 


20715-20734 


BC20f 


6365 


27491-27510 




BAIr 


6388 


20345-20364 


BC21f 


6366 


27845-27864 




BA2r 


6389 


19969-19988 










BA3r 


6390 


19599-19618 










BA4r 


6391 


19228-19247 










BA5r 


6392 


18852-18871 



Table 8: Primers 



Name 


SEQ ID NO 


Co-ordinates 


Name 


SEQ ID NO 


Co-ordinates 


FI 


6393 


1-19 


■ R1 


6441 


334-315 


F2 


6394 


292-310 


R2 


6442 


749-731 


F3 


6395 


721-742 


R3 ' 


6443 


1077-1058 


F4 


6396 


984r1003 


R4 


6444 


1479-1460 


F5 


6397 


1420-1441 


R5 


6445 


1834-1811 


F6 


6398 


1740-1764 


R6 


6446 


2251-2232 


F7 


6399 


2226-2245 


R7 . 


6447 


2749-2728 


F8 


6400 


27^-2763 


: R8 


6448 


3082-3058 


F9 


6401 


3007-3031 


: R9 


6449 


3497-3478 


F10 


6402 


3453-3476 


■ RIO . 


6450 


3877-3853 


F11 


6403 


4007-4027 


R11 


6451 


4316-4295 


FI 2 


6404 


4366-4387 


R12 


6452 


4708-4690 


FI 3 


6405 


4658-4677 


R13 


6453 


5115-5092 


FI 4 


6406 


5239-5260 


R14 


6454 


5560-5542 


FI 5 


6407 


5593-5612 


R15 


6455 


6291-6271 


FI 6 


6408 


6271-6291 


R16 


6456 


7171-7153 


FI 7 


6409 


7253-7272 


R17 


6457 


7677-7654 


FI 8 


6410 


7615-7635 


R18 


6458 


7936-7915 


FI 9 


6411 


7845-7867 


R19 


6459 


8189-8170 


F20 


6412 


8143-8166 


R20 


6460 


8388-8369 


F21 


6413 


8553-8575 


R21 


6461 


9254-9236 


F22 


6414 


9244-9267 


R22 


6462 


9990-9969 


F23 


6415 


10009-10027 


R23 


6463 


10244-10223 


F24 


6416 


10242-10265 


R24 


6464 


10783-10763 


F25 


6417 


10766-10785 


R25 


6465 


11305-11287 


F26 


6418 


11265-11287 


R26 


6466 


11730-11709 


F27 


6419 


11705-11726 


R27 


6467 


11984-11967 


F28 


6420 


12040-12058 


R28 


6468 


12406-12388 


F29 


6421 


12366-12384 


R29 


6469 


12994-12976 


F30 


6422 


12948-12966 


R30 


6470 


13324-13300 


F31 


6423 


13237-13258 


R31 


6471 


13963-13945 
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F32 


6424 


14080-14098 


F33 


6425 


14882-14906 


F34 


6426 


15113-15134 


F35 


6427 


15364-15387 


F36 


6428 


15513-15532 


F37 


6429 


16073-16096 


F38 


6430 


16366-16385 


F39 


6431 


16832-16852 


F40 


6432 


17354-17372 


F41 


6433 


17603-17623 


F42 


6434 


18011-18030 


F43 


6435 


18270-18292 


F44 


6436 


18550-18571 


F45 


6437 


18960-18981 


F46 


6438 


19310-19329 


F47 


6439 


19707-19731 


F48 


6440 


19833-19851 



Table 9: Frimers 





Name 


SEQ ID NO: 


1 


CB12R 


6488 


2 


, R0010 


6489 


3 


R0011 


. 6490 


El 


R0012 


6491 


5 


BNI-ED 


6492 


6 


BNI-EU 


6493 


B 


SAR1S-U 


6494 


8 


SARIAs-D 


6495 


9 


SARIS 


6496 


10 


SARI As 


6497 


11 


IN2-U 


6498 


12 


IN4-D 


6499 


13 


IN-2 


6500 


14 


IN-4 


6501 


15 


IN-6 


6502 


16 


IN-7 


6503 


17 


C0R1-U 


6504 


18 


C0R2-D 


6505 


19 


COR-1 


6506 


20 


COR-2 


6507 


21 


HKUF-U 


6508 


22 


HKUR-D 


6509 


23 


HKU-F 


6510 


24 


HKU-R 


6511 


25 


1451-D 


6512 


26 


1451-U 


6513 


27 


690-D 


6514 


28 


690-U 


6515 


29 


690-D2 


6516 
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R32 


6472 


14561-14540 


R33 


6473 


15145-15124 


R34 


6474 


15383-15363 


R35 


6475 


15605-15585 


R36 


6476 


15999-15978 


R37 


6477 


16404-16380 


R38 


6478 


16777-16758 


R39 


6479 


17359-17340 


R40 


6480 


17573-17552 


R41 


6481 


17883-17862 


R42 


6482 


18225-18205 


R43 


6483 


18648-18629 


R44 


6484 


19004-18983 


R45 


6485 


19217-19195 


R46 


6486 


19719-19701 


R47 


6487 


19921-19901 





Name 


SEQ ID NO:' 


37 


EMC8-D2 


6524 


38 


EMC8-U2 


6525 * 


39 


EMC11-D 


6526 


40 


EMC11-U 


6527 : 


mm 




6528 


mm 




6529 


43 


ORFS-D 


6530 


44 


ORFS-U 


6531 


45 


E7-717F 


6532 


46 


E8-85R 


6533 


47 


E8-307F 


6534 


48 


Ell -771 F 


6535 


49 


E11-96R 


6536 


50 


CON1-F 


6537 , 


51 


CON1-U 


6538 


;BI 


CON2-F 


6539 


BAB 


CON2-R 


6540 


54 


CON3-F 


6541 


55 


CON3-R 


6542 


56 


15-F 


6543 


57 


15-R 


6544 


58 


15-F2 


6545 


59 


15-R2 


6546 


60 


13-F 


6547 


61 


13-R 


6548 


62 


13-F2 


6549 


63 


13-R2 


6550 


wm 


CONTIG-F 


6551 


El 


QT3-R 


6552 



- 393 - 



























































































































































wo 2004/092360 



PCT/DS2004/011710 



30 


690-U2 


6517 


31 


EMC7-D 


6518 


32 


EMC7-U 


6519 


33 


EMC7-D2 




34 


EMC7-U2 




35 


EMC8-D 


6522 


36 


EMC8-U 


6523 



66 


QT3-F 


6553 


67 


QIN-R 


6554 


68 


QIN-F 


6555 


69 


AB1-F 


6556 


70 


AB2-F 


6557 


71 


AB3-F 


6558 


72 


AB1-R 


6559 


































wo 2004/092360 



PCT/US2004/011710 



Table 10; Features of the predicted proteins and open reading frames of the SAKS yirus 



SARS ORF Length Role 

(SEQ ID NO) (aa) 



179 Leader protein 



P28 (9766) 



P65{9767) 



Cleavage 

sfte 




Nsp7 (9774) 



Nsp9 (9775) 



NspIO (9776) 



Nspll (9777) 



Nsp12 (9778) 



Nsp13 (9779) 



Spike (S) 
(6042) 



Orf3(6043) 



Orf4 (6044) 



Envelope (E) 
(6045) 



Matrix (M) 
(6046) 



(6047) 



Orf8 (6048) 



Orf9 (6049) 



OrflO 



Orfi 1(6050) 



NudeocapsM (N) 
(6052) 



Orfi 3 







Homologue of MHV p65 
cleavage product 



Papain like protease, 
cleaves the first two 
proteins 




818 (G/A) 



3240 (Q/S) 



phosphoesterase domain 
Zn binding domain 




3C4ike protease, cleaves 3546 (Q/G) 
proteins nsp1-nsp12 



3836 (Q/S) 






5TMDs 





RNA polymerase 



Putative helicase 
Tanner dd poos) JBsrCtez}27ft3Kr&62 



3919 (Q/A) 



4117 (Q/N) 



4230 (Q/A) 



4369 (Q/S) I Putative growth factor-like motif 



5298 (Q/A) 



5899 (Q/A) Metal binding domain, 
ATP/GTP binding domain 





Major antigenic 
determinant, contains the 
receptor-binding domain 





Associated with viral 
envelope 



Assodated with viral 
envelope, membrane 
spanning protein 




Leader peptide, 1 TMD, 17 N- 
glycosylation sites 



2 TMDs, 1 N-glycosytation site, 40 
O-gtycosytation sit^ 



1 TMD, 2 N-^ycosylation sites .• 



3 TMDs, 1 N-glycosyiation site 







Associaled wim vifBl geriomic RNA 



Surface-associated 



Surface-associated 



1 N-g)ycosylation site 



phosphoprotein 



1 Og^ycosylation site 



TMD: predicted transmembrane domain. 

Cons *: + indicates presence of corresponding protein at least in one of the other coronaviruses 
#: Alternatively, cleaved after Gly-Gly (Le. at G/A) to give a ISOmer 

##: This 2422mer may be further cleaved after residue 1922 (Gly-2740 of SEQ ID NO: 6039) to 
give a 1922mer Pl^ro containing the Zii-binding motif (SEQ ID NO: 7254) and a SOOmer. 



- 395 - 




























































































wo 2004/092360 



PCT/US2004/011710 



Table 11: Protdn homologies between SARS and other coronaviruses 

Numbers indicate percentage of aminoacid identity between SARS proteins and corresponding 
gene products of other coronaviruses. More conserved pairs are in bold; more variable pairs are 
underlined. 





group 1 


group 2 


group 3 


Proteins 


229E 


TGV 


PEDV 


MHV 


BCoV 


AIBV 


REPUCASE REGION 




leader protein p28 


<20 


<20 


<20 


27 


<20 


<20 


p65 homologue 


<20 


23 


23 


<20 


20 


<20 


nspi (PLP protease) 


25.5 


25.8 


25.4 


29 


30 


25 


nsp2 (3CL protease) 


40.4 


43.8 


44.6 


50 


48.4 


41 


nsp3‘ 


30 


2Z 


29.4 


34.2 


35.5 


28.5 


nsp4 


38.6 . 


42.2 


39.8 


47.5 


46.1 


37.3 


nsp5 


48.2 


42.9 


43.9 


46.8 


47.3 


38.7 


nsp6 


45.1 


38.9 


45.1 


45.1 


46.9 


39.8 


nsp7 


53.8 


54.5 


56.1 


56.2 


55.4 


58.3 


nsp9 (polymerase) 


59.8 


59.6 


60 


67.3 


66,9 


62.4 


nsp10 (helical) 


60.7 


62 


62.3 


67.2 


68.6 


58.9 


nsp11 


52.3 


53.7 


52.3 


57.6 


57.6 


52 


nspi 2 


43.1 


43 


45.4 


45.9 


45 


40.2 


nsp13 


56.4 


54.4 


55,3 


63 


65 


53.4 


STRUCTURAL REGION 


Spike (S) 


28.8 


31.6* 


30.3 


31.1 


31 


32.r 


Envelope (E) 


33* 


27.9 


20 


23 


26.5 


23.2 


Matrix glycoprotein (M) 


30.6 


3^5 


34.8 


40.8 


41.9 


32.5 


Nucleocapsid (N) 


26.9 


30.1 


29.5 


37.3 


37A 


31.5 



* These three alignments were obtained only on a fingment of the whole protein. 
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Table 12 : Nucleotide and aminoadd differences between five SASS isolates 







FRA* 


TOR2* 


Urbani* 


CUHK* 


HKU* 


position® 


base/aminoacid 


base/aminoactd 


base/amlnoackl 


base/aminoacid 


base/aminoacid 
















ORFla 


2557 


A/Thr 


G/Ala 


G/Ala 




G/Ala 


2601 










c 


7746 


G/Pro 






T 




7919 


C/Ala 




T/Val 






7930 


G/Asp 








A/Asn 


KfiHSi 


G/Ser 








C/Thr 


MSM 


G/Arg 








C/Thr 


9404 


T/Val 










9479 


T/Val 






C/Ala 




11448 


T/lle 


C 


C 


C 


c 
















ORFlb 


13494 


GT/Vai 








AG/Ser 


16622 


C/Ala 




T 






17564 


T/Asp 






C/Glu 




17846 


C/Arg 






T 




18065 


G/Lys 








A 


18965 


A/lle 


T 


T 


T 


T 


19064 






V G 


G 




19084 


T/lle 


Cmir 


C/7hr 


C/TTir 


C/Thr 


spike 




G^GIy 






A/Asp 






T/lle 






C/Thr 




23220 


T/Ser 


G/Ala 








24872 


T/Leu 




C 






24933 


T/Phe 


C/Leu 


C/Leu 


C/Leu 


C/Leu 


ORF3 


25298 


G/Gly 


A/Arg 










25569 


T/Met 








A/Lys 
















nrmtrix 


26600 


T/VaJ 


C/Ala 


G/Ala 


C//Va 






26857 


T/Ser 




C/Pro 






ORF10 


27827 


T/Cys 






C/Arg 




nucleocapsid 


28268 




C/Thr 


C/Thr 


C/Thr 


C/Thr 



* SARS coFonaviius FRA (accession number AYS 10120) 

SARS coionavirus TOR2 (accession.number AY274119) 

5 SARS coionavirus Uibani (accession number AY278741) 

SARS coronaviius CUHK-Wl (accession number AY278554) 
SARS coronaviius HKU-39849 (accession numba: AY278491) 

® The position is based on the ERA sequence. 

0 
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TABLES 13-25: T-epitope predictions for SEQ ID NOS: 6039-6050 & 6052 

Epitope predictions were performed at ht^:/Avww.nq>iib-beTiiriTnpg.de/MAPPP/bmdmg.html using a 
mihimum score of 0.5 and the BIMAS matrix, with a maximum of 20 results being selected. The 
analysis revealed 9mer and lOmer epitopes. 

Table 13: Epitopes for SEQ ID NO: 6039 



HLA A1 - 9 mers 



using this molecule 



Maximum possible score 



5625 



Score 



Rank 



Start position 



Sequence 



<Vb of max. score 



8 % 



1867 



4139 



SEO ID NO: 



SEQ ID NO: 



7400 



7401 



450 



312.5 



5.55 % 



4% 



88 



4249 



SEQ ID NO: 



SEO ID NO: 



7402 



7403 



3.55 % 



2.22 % 



4059 



2027 



SEO ID NO: 



SEQ ID NO: 



7404 



7405 



1.6 % 



1.11 % 



3413 



1823 



SEO ID NO: 



SEO ID NO: 



7406 



7407 



62.5 



50 



0.88 % 



0.88 % 



2798 



220 



SEO ID NO: 



SEQ ID NO: 



7408 



7409 



0.8 % 



0.8 % 



3738 



4182 



SEQ ID NO: 



SEQ ID NO: 



7410 



7411 



0.8 % 



0.66 % 



4174 



1940 



SEQ ID NO: 



SEQ ID NO: 



7412 



7413 



37.5 



31.25 



0.55 % 



0.48 % 



38 



1231 



SEO ID NO: 



SEQ ID NO: 



7414 



7415 



0.44 % 



0.44 % 



1613 



3645 



SEO ID NO: 



SEQ ID NO: 



7416 



7417 



0.44 % 



0.44 % 



0.4 % 



4192 



378 



SEQ ID NO: 



SEQ ID NO: 



7418 



7419 



25 



22.5 



HLA A1 - 10 mers 



Maximum possible score using this molecule 



5625 



Score 



Rank H Start position 



Sequence 



SEQ ID NO: 7420 



<*/o of max. score 



8 % 



1867 



1495 



SEO ID NO: 7421 



SEO ID NO: 7422 



SEO ID NO: 7423 



4% 



2.4 % 



3921 



486 



4139 



2.22 % 



2.22 % 
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62 



1190 



SEO ID NO: 



SEQ ID NO: 



7425 



7426 



1.6 % 



1.6 % 



1284 



3284 



SEO ID NO: 



SEQ ID NO: 



7427 



7428 



1.6 % 



1.6 % 



2921 



349 



SEO ID NO: 



SEQ ID NO: 



7429 



7430 



1.2 % 



0.8 % 



67.5 



45 



789 



1185 



SEQ ID NO: 



SEQ ID NO: 



7431 



7432 



0.8 % 



0.8 % 



4184 



1313 



SEQ ID NO: 



SEQ ID NO: 



7433 



7434 



0.8 % 



0.64 % 



3948 



149 



SEQ ID NO: 



SEQ ID NO: 



7435 



7436 



0.48 % 



0.44 % 



941 



1390 



1613 



SEO ID NO: 



SEQ ID NO: 



SEO ID NO: 



7437 



7438 



7439 



0.44 % 



0.44 % 



0.44 % 



HLA A3 - 9 mers 



using this molecule 



Maximum possible score 



12150 



Score 



Rank 



Start position 



1010 



Sequence 



% of max. score 



1 . 48 % 



SEO ID NO: 



SEQ ID NO: 



7440 



7441 



3155 



1229 



1.48 % 



1.23 % 



SEQ ID NO: 



SEQ ID NO: 



7442 



7443 



2405 



2 



0.88 % 



0.74 % 



SEQ ID NO: 



SEQ ID NO: 



7444 



7445 



2304 



2358 



0.74 % 



0.74 % 



SEQ ID NO: 



SEO ID NO: 



7446 



7447 



3160 



3771 



0.74 % 



0.74 % 



SEQ ID NO: 



SEO ID NO: 



7448 



7449 



4007 



3079 ■ 



0.74 % 



0.66 % 



SEQ ID NO: 



SEQ ID NO: 



7450 



7451 



4045 



1081 



0.66 % 



0.49 % 



3268 



4144 



SEQ ID NO: 



SEQ ID NO: 



7453 



7454 



0.49 % 



0.49 % 



SEO ID NO: 



SEQ ID NO: 



7455 



7456 



0.37 % 



0.37 % 



1537 



313 



SEQ ID NO: 



SEQ ID NO: 



SEQ ID NO: 



7457 



7458 



7459 



0.37 % 



0.32 % 



0.32 % 




1744 
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Maximum possible score 



HLA A3 - 10 mers 



using this molecule 



Rank 



Start position 



62 



2151 



633 



1158 



2565 



2298 



3159 



640 



2186 



3869 



2308 



1080 



2358 



3955 



714 



1081 



1170 



1228 



Sequence 



SEO ID NO: 



SEO ID NO: 



7460 



7461 



SEQ ID NO: 



SEQ ID NO: 



7462 



7463 



SEQ ID NO: 



SEQ ID NO: 



7464 



7465 



SEQ ID NO: 



SEQ ID NQ: 



7466 



7467 



SEQ ID NO: 



SEQ ID NO: 



7468 



7469 



SEO ID NO: 



SEQ ID NO: 



SEQ ID NO: 



7470 



7471 



7472 



SEQ ID NO: 7474 



SEQ ID NO: 7475 



SEQ ID NO: 7476 



SEQ ID NO: 7477 



SEQ ID NO: 7478 



SEQ ID NO: 7479 



12150 



Vo of max. score R Score 



4.44% I .540 



2.46 % 



2.22 % 



2.22 % 



2.22 % 



1.77 % 



1.11 % 



0.98 % 



0.74 % 



0.74 % 



0.66 % 



0.55 % 



0.49 % 



0.49 % 



0.49 % 



0.37 % 



0.37 % 



0.37 % 



0.37 % 



HLA A24 - 9 mers 



Maximum possible score using this molecule 



3797 



4202 



3189 



1864 



1066 



2143 



2693 



1426 



1238 



3768 



797 



SEQ ID NO: 7480 



SEQ ID NO: 7481 



SEQ ID NO: 



SEQ ID NO: 



7484 



7485 



SEQ ID NO: 



SEQ ID NO: 



7486 



7487 



SEO ID NO: 



SEO ID NO: 



SEO ID NO: 



7488 



7489 



7490 



Vo of max. score 



37.57 % 



37.57 % 



25.05 % 



23.14 % 



22.54 % 



22.54 % 



22.54 % 



18.78 % 



18.03 % 



18.03 % 



15.03 % 



67.5 



60 



1596.672 



Score 



600 



369.6 



360 




- 400 - 







































































































































1882 



1490 



2237 



95 



1821 



2289 



3080 



3660 



4354 



SEO ID NO: 7491 



SEO ID NO: 7492 



SEO ID NO: 7493 



SEQ ID NO: 7494 



SEQ ID NO: 7495 



SEQ ID NO: 7496 



SEQ ID NO: 7497 



SEQ ID NO: 7498 



SEQ ID NO: 7499 



15.03 % 



13.77 % 



13.77 % 



12.52 % 



12.52 % 



12.52 % 



12.52 % 



12.52 % 



12.52 % 



HUV A24 - 10 mers 



Maximum possible score using this molecule 



Rank II Start position 



2143 



2 B 1159 



3 



1150 



5 II 2763 



3165 



SEQ ID NO: 7500 



SEQ ID NO: 7501 



SEQ ID NO: 7502 



SEQ ID NO: 7504 




SEQ ID NO: 7516 



SEQ ID NO: 7517 



SEQ ID NO: 7518 



SEQ ID NO: 7519 



% of max. score 



37.87% 



26.30 % 



26.30 % 



18.78 % 



18.78 % 



■ 18.78% 



15.03 % 



15.03 % 



15.03 % 



13.77 % 



12.52 % 



12.52 % 



12.52 % 



12.52 % 



12.52 % 



12.52 % 



11.27 % 



9.39 % 



9.39 % 



9.39 % 



■ HLA A 0201 - 9 mers 



Maximum possible score using this molecule 



Rank 




1596.672 



Score 



604.8 



420 



420 



300 



300 



300 



240 



240 



240 



220 



200 



200 



200 



200 



200 



200 



180 



150 



150 



150 



Start position 



2308 



3729 



Sequence 



SEQ ID NO: 7520 



SEQ ID NO: 7521 



% of max. score 



0.20 % 



0.10 % 



3925227.1 



Score 



8144.13515256 



4047.23088 
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3574 



3615 



3159 



2339 



2201 



3559 



3085 



4070 



3708 



3098 



1362 



3563 



3774 



4242 



2340 



65 



3862 



2860 



SEQ ID NO: 7524 



SEQ ID NO: 7525 



SEQ ID NO: 7526 



SEQ ID NO: 7527 



SEQ ID NO: 7528 



SEQ ID NO: 7530 



SEQ ID NO: 7532 



SEQ ID NO: 7533 



SEQ ID NO: 7535 



SEQ ID NO: 7536 



SEQ ID NO: 7537 



SEQ ID NO: 7538 



SEQ ID NO: 7539 



0.09 % 



0.06 % 



0.05 % 



0.03 % 



0.03 % 



0.02 % 



0.02 % 



0.02 % 



0.02 % 



0.02 % 



0.02 % 



0.01 % 



0.01 % 



0.01 % 



0.01 % 



0.01 % 



0.01 % 



0.01 % 



3547.4996634 



2722.682592 



1999.734264 



1551.92907744 



1521.53694 



1174.38939504 



1146.296448 



970.4103696 



958.92888 



942.678 



900.6984 



735.86016 



687.655656 



685.78272 . 



668.37342936 



640.1983392 



620.57772 



607.88448. 



HLA A 0201 - 10 mers 



Maximum possible score using this moiecuie 



Rank 







2307 



2201 



1772 



3087 



2339 



2308 



3061 



2748 



3837 



59 



2877 



4114 















SEQ ID NO: 7540 






SEQ ID NO: 7542 



SEQ ID NO: 7543 






SEQ ID NO: 7546 



SEQ ID NO: 7547 



SEQ ID NO: 7548 



SEQ ID NO: 7549 



SEO ID NO: 7550 






SEQ ID NO: 7552 



% of max. score 



0.40 % 



0.12 % 



0.05 % 



0.04 % 



0.03 % 



0.02 % 



0.02 % 



0.02 % 



0.01 % 



0.01 % 



0.01 % 



0.01 % 



0.01 % 



0.01 % 



0.01 % 



0.00 % 



3925227.1^ 



Score 



15915.66281448 



4772.09313 



2295.04855632 



1759.6656 



1215.76896 



1116.29986272 



970.14776112 



836.2525104 



726.706344 



720.8292 



650.3112 



620.22996 



559.8936 



484.4565072 



437.48208 



319.9392 



305.94186 
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1223 



SEQ ID NO: 7557 



0.00 % 



289.08792 



852 



2139 



SEQ ID NO: 7558 



SEQ ID NO: 7559 



0.00 % 



0.00 % 



285.67242 



284.845869 



HLA A 1101 - 9 mers 



using this molecule 



Sequence H *V6 



Maximum possible score 



Rank H Start position 



of max. score 



50 % 



36 



Score 



4200 



281 



SEQ ID NO: 



SEQ ID NO: 



7560 



7561 



25 % 



25 % 



16.66 % 



3236 



509 



SEQ ID NO: 



SEQ ID NO: 



7562 



7563 



848 



2193 



SEQ ID NO: 



SEQ ID NO: 



7564 



7565 



16.66 % 



16.66 % 



3542 



541 



SEQ ID NO: 



SEQ ID NO: 



7566 



7567 



16.66 % 



15% 



1748 



829 



SEQ ID NO: 



SEQ ID NO: 



7568 



7569 



12.5 % 



11.11 % 



1149 



2027 



SEQ ID NO: 



SEQ ID NO: 



7570 



7571 



11.11 % 



11.11 % 



2576 



873 



SEQ ID NO: 



SEQ ID NO: 



7572 



7573 



11.11 % 



8.33% 



2725 



3541 



SEQ ID NO: 



SEQ ID NO: 



7574 



7575 



8.33 % 



8.33 % 



1837 



2475 



SEQ ID NO: 



SEQ ID NO: 



7576 



7577 



7.5 % 



7.5 % 



2703 



1823 



SEQ ID NO: 



SEQ ID NO: 



7578 



7579 



7.5 % 



6.66 % 



HLA A 1101 - 10 mers 



Maximum possible score using this moiecule 



Rank 



Start position 



Sequence 



% of max. score 



Score 



18 



3541 



281 



SEQ ID NO: 



SEQ ID NO: 



7580 



7581 



50 % 



25 % 



1495 



2303 



SEQ ID NO: 



SEQ ID NO: 



7582 



7583 



25 % 



25 % 



2616 



48 



SEQ ID NO: 



SEQ ID NO: 



7584 



7585 



25 % 



16.66 % 



1394 



1499 



SEQ ID NO: 



SEQ ID NO: 



7586 



7587 



16.66 % 



16.66 % 
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Rank 



Start position 



1703 



Seauence 



°/o of max. score 



3.70% 



Score 



17 



3008 



SEQ ID NO: 



SEO ID NO: 



7620 



7621 



2.22 % 



2.22 % 



SEO ID NO; 



SEO ID NO; 



7622 



7623 



4106 



3450 



2.22 % 



1.66 % 



SEO ID NO; 



SEO ID NO; 



7624 



7625 



1.48 % 



1.48 % 



SEQ ID NO; 



SEO ID NO: 



7626 



7627 



1.48 % 



1.48 % 



SEQ ID NO: 



SEO ID NO: 



7628 



7629 



1000 



1072 



1.48 % 



1.48 % 



SEQ ID NO: 



SEQ ID NO: 



7630 



7631 



1404 



1980 



1.48 % 



1.48 % 



SEO ID NO: 



SEQ ID NO: 



7632 



7633 



2262 



2543 



SEQ ID NO: 



SEO ID NO: 



7634 



7635 



1.48 % 



1.48 % 



2906 



3077 



3175 



4195 



4251 



SEQ ID NO: 



SEO ID NO: 



7636 



7637 



1.48 % 



1.48 % 



1.48 % 



1.48 % 



1.48 % 





SEO ID NO: 



SEQ ID NO: 



7638 



7639 



Table 14: Epitopes for SEQ ID NO: 6040 



HLA A1 - 9 mers 



Maximum possible score using this molecule 



Rank 



Start position 



Sequence 



SEQ ID NO: 7640 



SEQ ID NO: 7641 



SEQ ID NO: 7642 



SEQ ID NO: 7643 



SEQ ID NO: 7644 



% of max. score 



0.04 % 



0.01 % 



0.01 % 



0.00 % 



0.00 % 



HLA A1 - 10 mers 



Maximum possible score usinq this molecule 



Rank 



Start position 



20 



Sequence 



SEO ID NO: 7645 



SEQ ID NO: 7646 



SEQ ID NO: 7647 



% of max. score 



0.01 % 



0.00 % 



0.00 % 



5625 



Score 



2.25 



0.75 



5625 



Score 



0.9 



- 405 - 
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HLA A3 - 9 mers 



Maximum possible score usinq this molecule 



Rank 





Start position 



115 



87 



80 



125 



39 



56 



135 



91 



119 



141 



150 



137 



Sequence 



SEO ID NO: 7649 



SEO ID NO: 7650 



SEQ ID NO: 7651 



SEQ ID NO: 7652 



SEQ ID NO: 7653 



SEQ ID NO: 7654 



SEQ ID NO: 7655 



SEQ ID NO: 7656 



SEQ ID NO: 7657 



SEO ID NO: 7658 



SEQ ID NO: 7659 



SEQ ID NO: 7660 




% of max. score 



0.24 % 



0.04 % 



0.03 % 



' 0.01 % 



0.01 % 



0.01 % 



0.00 % 



0.00 % 



0.00 % 



0.00 % 



0 . 00 % 



0.00 % 



HLA A3 - 10 mers 



Maximum possible score usinq this molecule 



Rank 





Start position 



36 



144 



101 



99 



80 



125 



71 



118 



40 



5 



56 



107 



135 



141 



148 



116 



Sequence 



SEQ ID NO: 7661 



SEQ ID NO: 7662 



SEQ ID NO: 7663 



SEQ ID NO: 7664 



SEQ ID NO: 7665 



SEQ ID NO: 7666 



SEQ ID NO: 7667 



SEQ ID NO: 7668 



SEQ ID NO: 7669 



SEQ ID NO: 7670 



SEQ ID NO: 7671 



SEQ ID NO: 7672 



SEQ ID NO: 7673 



SEO ID NO: 7674 



SEQ ID NO: 7675 



SEQ ID NO: 7676 



% of max. score 



0.24 % 



0.06 % 



0.03 % 



0.02 % 



0.02 % 



0.01 % 



0.01 % 



0.01 % 



0.01 % 



0.00 % 



0.00 % 



0.00 % 



0.00 % 



0.00 % 



0.00 % 



0.00 % 



12150 



Score 



30 



6 



4.05 



1.8 



1.5 




0.9 



0.6 



0.54 



12150 



Score 



30 



8 



1.6875 



1.35 



0.9 



0.9 



0.6 



0.6 



0.6 



0.6 



0.5 



HLA A24 - 9 mers 



■ 406 - 


























































































































wo 2004/092360 



PCT/US2004/011710 



Maximum possible score usinq this molecule 



1596.672 



Score 



Rank 



Start position 



153 



Vo of max. score 






1.05 % 



0.75 % 



16.8 



12 



0.50 % 



0.50 % 



SEQ ID 



7680 



0.45 % 



0.45 % 



0.45 % 



0.41 % 



SEQ ID 



7684 



0.37 % 



0.37 % 



0.37 %. 



0.37 % 



SEQ ID 



7688 



0.30 % 



0.30 % 



SEQ ID 



7690 



0.30 % 



0.30 % 



0.27 % 



0.25 % 



0.25 % 



0.25 % 



HLA A24 - 10 mers 



Maximum possible score using this molecule 






Rank 



Start position 



of max. score 



12.52 % 



Score 



200 



SEQ ID NO: 7698 



0.75 % 



0.50 % 



SEQ ID NO: 7700 



SEQ ID NO: 7701 



SEQ ID NO: 7702 



0.45 % 



0.45 % 



0.37 % 



0.30 % 



SEQ ID NO: 7704 



SEQ ID NO: 7705 



0.30 % 



0.30 % 



SEQ ID NO: 7706 



SEQ ID NO: 7707 



0.30 % 



0.30 % 



SEO ID NO: 7708 



SEO ID NO: 7709 



0.27 % 



0.27 % 



-4Cn- 




































































































































SEO ID NO: 7710 



SEQ ID NO: 7711 



SEO ID NO: 7712 



0.25 % 



0.25 % 



0.25 % 



0.22 % 



0.18 % 



0.18 % 



0.12 % 



HLA A 0201 - 9 mers 



Maximum possible score usinq this molecule 



Rank 






Sequence 




SEQ ID NO: 7719 



SEQ ID NO: 7720 



SEQ ID NO: 7723 









SEO ID NO: 7730 



% of max. score 



0.00 % 



0.00 % 



0.00 % 



0.00 % 



0.00 % 



0.00 % 



0.00 % 



0.00 % 



0.00 % 



0.00 % 



0.00 % 



0.00 % 



0.00 % 



0.00 % 



0.00 % 



0.00 % 



0.00 % 



0.00 % 



0.00 % 



3925227.1 



Score 



171.96732 



11.6146182 



11.304684 



10.34586 



6.56830734 



6.07614 



5.981472 



5.194044 



4.299183 



4.299183 



4.138344 



3.42792 



3.383484 



3.381 



3.251556 



2.9542005 



1.982232 



1.86921 



1.76402985 



HLA A 0201 - 10 mers 



Maximum possible score using this molecule 



Rank 






% of max. score 



SEQ ID NO: 7737 



0.00 % 



0.00 % 



SEQ ID NO: 7739 



SEQ ID NO: 7740 



0.00 % 



0.00 % 



3925227.1 



Score 






69.552 



36.5148 



21.3624 




^ 8 - 

































































































































wo 2004/092360 
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SEQ ID NO: 7741 



SEQ ID NO: 7744 



SEQ ID NO: 7750 



SEQ ID NO: 7754 



SEQ ID NO: 7756 



0.00 % 



0.00 % 



0.00 % 



0.00 % 



0.00 % 



0.00 % 



0.00 % 



0.00 % 



0.00 % 



0.00 % 



0.00 % 



0.00 % 



0.00 % 



0.00 % 



17.73576 



13.9104 



11.304684 



8.230458 



7.3086111 



6.174104475 



6.0858 



5.6823984 



3.188283 






2.076624 



2.005692 



0.00 % 



0.00 % 



1.798209 



1.68996456 



1.516482 



1.2683304 



HLA A 1101 - 9 mers 



Maximum possible score usinq this molecule 






Rank I Start position 



Sequence 



SEQ ID NO: 7757 



'Vo of max. score 



2.77 % 



36 



Score 



HLA A 1101 " 10 mers 



Maximum possible score using this molecule 



Rank 



Start position 






Sequence 



SEQ ID NO: 7758 



SEQ ID NO: 7759 



SEQ ID NO: 7760 



Vo of max. score 



33.33 % 



2.77 % 



1.66 % 



36 



Score 



HLA B7 - 9 mers 



Maximum possible score using this molecule 



Rank 



Start position 



Sequence 



SEQ ID NO: 7761 



SEQ ID NO: 7762 



SEQ ID NO: 7763 



SEQ ID NO: 7764 



SEO ID NO: 7765 



Vo of max. score 



2.22 % 



1.11 % 



0.66 % 



0.37 % 



0.37 % 



SEQ ID NO: 7766 



SEQ ID NO: 7767 



0.37 % 



0.22 % 



5400 



Score 



- 409 - 































































































































wo 2004/092360 



PCT/DS2004/011710 



SEQ ID NO: 



SEQ ID NO: 



7768 



7769 



SEQ ID NO: 



SEQ ID NO: 



7770 



7771 



0.22 % 



0.22 % 



SEQ ID NO: 



SEQ ID NO: 



7772 



7773 



0.14 % 



0.09 % 



0.09 % 



0.07 % 



SEQ ID NO: 



SEQ ID NO: 



7774 



7775 



SEO ID NO: 



SEQ ID NO: 



7776 



7777 



SEQ ID NO: 



SEQ ID NO: 



SEQ ID NO: 



7778 



7779 



7780 



0.07 % 



0.07 % 



0.07 % 



0.07 % 



0.07 % 



0.07 % 



0.07 % 



HLA B7 - 10 mers 



Maximum possible score using this molecule 



Rank 



Start position 



99 



Sequence 



SEO ID NO: 7781 



SEQ ID NO: 7782 



SEQ ID NO: 7783 



SEQ ID NO: 7784 



SEQ ID NO: 



SEO ID NO: 



7786 



7787 



SEO ID NO: 



SEQ ID NO: 



7788 



7789 



SEO ID NO; 



SEQ ID NO: 



7790 



7791 



SEQ ID NO: 



SEQ ID NO: 



7792 



7793 



% of max. score 



0.74 % 



0:37 % 



0.22 % 



0.22 % 



0.22 % 



0.11 % 



0.07 % 



0.07 % 



0.07 % 



0.07 % 



0.07 % 



SEQ ID NO: 



SEQ ID NO: 



7794 



7795 



SEO ID NO: 



SEO ID NO: 



7796 



7797 



SEO ID NO: 



SEQ ID NO: 



SEQ ID NO: 



7798 



7799 



7800 



0.07 % 



0.07 % 



0.07 % 



0.05 % 



0.05 % 



0.03 % 



0.03 % 



0.03 % 



0.02 % 



5400 



Score 




^ 10 - 































































































































wo 2004/092360 



PCT/US2004/011710 



Table 15: Epitopes for SEQ ID NO: 6041 



Maximum possible score 



Rank II Start position 



HLA A1 - 9 mers 



usinq this molecule 



5625 



Score 



Sequence 



% of max. score 



1.6 % 



1.33 % 



1818 



373 



SEO ID NO: 



SEQ ID NO: 



7801 



7802 



SEO ID NO: 



SEQ ID NO: 



7803 



7804 



1.33 % 



0.88 % 




SEO ID NO: 



SEQ ID NO: 



7805 



7806 



0.88 % 



0.88 % 



SEO ID NO: 



SEQ ID NO: 



7807 



7808 



0.8 % 



0.8 % 



1271 



1799 



SEO ID NO: 



SEQ ID NO: 



7809 



7810 



0.8 % 



0.8 % 




1393 



386 



SEQ ID NO: 



SEQ ID NO: 



7811 



7812 



0.48 % 



0.44 % 



2304 



198 



840 



2359 



1194 



SEQ ID NO: 



SEQ ID NO: 



SEQ ID NO: 



7813 



7814 



0.44 % 



0.4 % 



7815 



78.16 



0.4 % 



0.4 % 



22.5 



SEQ ID NO: 



SEO ID NO: 



7817 



7818 



0.32 % 



0.32 % 



22.5 



18 



SEQ ID NO: 



SEO ID NO: 



SEQ ID NO: 



18 



12.5 



11.25 



2200 



996 



7819 



7820 



0.22 % 



0.2 % 



HLA A1 - 10 mers 



Maximum possible score using this molecule 



5625 



Score 



Rank 



Start position 



995 



1303 



1582 



1456 



Sequence 



SEQ ID NO: 7821 



SEQ ID NO: 



SEQ ID NO: 



7822 



7823 



% of max. score 



10 % 



2.22 % 



2% 



1.6 % 



562.5 



125 



112.5 



90 




SEQ ID NO: 



SEQ ID NO: 



7824 



7825 



632 



2281 



SEQ ID NO: 



SEO ID NO: 



7826 



7827 



1.11 % 



0.88 % 



0.88 % 



0.88 % 



62.5 



50 



SEQ ID NO: 



SEQ ID NO: 



7828 



7829 



1586 



2109 



0.8 % 



0.8 % 



0.55 % 



45 



45 



31.25 



SEQ ID NO: 



SEQ ID NO: 



7830 



7831 



- 411 - 
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1916 



966 



1387 



2263 



2457 



1057 



2562 



74 



298 



SEQID NO: 7832 



SEQ ID NO: 7833 



SEQID NO: 7834 



SEQ ID NO: 7835 



SEQID NO: 7836 



SEQID NO: 7837 



SEQ ID NO: 7838 



SEQ ID NO: 7839 



SEQ ID NO: 7840 



0.53 % 



0.44 % 



0.44 % 



0.44 % 



0.26 % 



0.22 % 



0.22 % 



0.17 % 



0.17 % 



HLA A3 - 9 mers 



Maximum possible score using this molecule 



Rank 



Start position 



536 




1049 



2200 



2598 



Sequence 



SEQ ID NO: 7841 



SEQ ID NO: 7842 



SEQ ID NO: 7843 



SEQ ID NO; 7844 



SEQ ID NO: 7845 



SEQ ID NO: 7846 



SEQ ID NO: 7847 



SEQ ID NO: 7848 



SEQ ID NO: 7849 



SEQ ID NO: 7850 



SEQ ID NO: 7851 



SEQ ID NO: 7852 



SEQ ID NO: 7853 



SEQ ID NO: 7854 



SEQ ID NO: 7855 



SEQ ID NO: 7856 



SEQ ID NO: 7857 



SEQ ID NO: 7858 



SEQ ID NO: 7859 



SEQ ID NO: 7860 



°/o of max. score 



3.33 % . 



2.46 % ; 



1.64 % .. 



1.48 % 



0.55 % 



0.49 % 



0.44 % 



0.44% 



0.37 % . 



0.32 % 



0.32 % 



0.32 % 



0.29 % 



0.24 % 



0.24 % 



0.24 % 



0.24 % 



0.22 % 



0.22 % 



0.22 % 



HLA A3 - 10 mers 



Maximum possible score using this molecule 



Rank I I Sequence 



SEQ ID NO: 7861 



SEQ ID NO: 7862 



30 



25 



25 



25 



15 



12.5 



12.5 



10 



10 




% of max. score 



2.46 % 



1.48 % 



12150 



Score 



405 



300 



200 



180 






12150 



Score 



300 



180 





























































































590 



697 



919 



1354 



SEQ ID NO: 



SEQ ID NO: 



SEO ID NO: 



7863 



7864 



7865 



1.11 % 



1.11 % 



0.74 % 



1430 



2534 



SEQ ID NO: 



SEO ID NO: 



7866 



7867 



0.74 % 



0.74 % 



SEO ID NO: 



SEQ ID NO: 



7868 



7869 



0.74 % 



0.49 % 



922 



1735 



SEQ ID NO: 



SEO ID NO: 



7870 



7871 



0.49 % 



0.49 % 



2281 



1894 



SEQ ID NO: 



SEO ID NO: 



7872 



7873 



0.49 % 



0.49 % 



2552 



555 



SEQ ID NO: 



SEQ ID NO: 



SEQ ID NO: 



7874 



7875 



0.44 % 



0.44 % 



1134 



1149 



SEO ID NO: 



SEQ ID NO: 



7876 



7877 



0.37 % 



0.37 % 



SEQ ID NO: 



SEQ ID NO: 



7878 



7879 



7880 



0.29 % V 



0.24 % ■ 



0.24% 



Maximum possible score 



Rank 



2375 



HLA A24 - 9 mers 



usinq this moiecule 



1751 



195 



2306 



806 



1252 



160 



1275 



2175 



2207 



2343 



1825 



1690 



SEO ID NO: 7881 



SEQ ID NO: 7882 



SEQ ID NO: 7883 



of max. score 



36.07 % 



28.93 % 



25.05 % 



21.04 % 



20.66 % 



SEO ID NO: 7886 



18.78 % 



15.03 % 



SEQ ID NO: 7888 



SEQ ID NO: 7889 



15.03 % 



12.52 % 



SEQ ID NO: 7890 



SEQ ID NO: 7891 



12.52 % 



12.52 % 



12.52 % 



12.52 % 



SEQ ID NO: 7894 



11.27 % 



7.51 % 



6.88 % 



4.69 % 




1596.672 



Score 







- 413 - 












































































































































wo 2004/092360 
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1949 



SEO ID NO: 7898 



SEO ID NO: 7899 



SEO ID NO: 7900 



3.75 % 



3.75 % 



3.38 % 



Maximum possible score 



HLA 



using this 



A24 - 10 mers 



molecule 



Rank 



Start position H S 



641 II SEQID 



of max. score 



45.09 % 



24.80 % 



1596.672 



Score 



809 



9 



SEQ ID 



SEO ID 



SEO ID 



NO: 7901 



NO: 7902 



NO: 7903 



NO: 7904 



SEO ID 



SEO ID 



NO: 7905 



NO: 7906 



22.54 % 



18.03 % 



15.03 % 




15.03 % 



15.03 % 



1436 



2219 



SEO ID 



SEO ID 



NO: 7907 



NO: 7908 



SEO ID 



SEO ID 



SEO ID 



NO: 7909 



NO: 7910 



NO: 7911 



15.03 % 



15.03 % 



1065 



1953 



13.77 % 



13.15 % 



1966 



2600 



12.52 % 



12.52 % 




SEO ID NO: 7914 



SEO ID NO: 7915 



9.39 % 



9.39 % 



1989 



342 



9.39 % 



8.76 % 



1071 



2570 



2550 



SEO ID NO: 7918 



8.76 % 



6.88 % 



6.26 % 



SEQ ID NO: 7920 



HLA A 0201 - 9 mers 



Maximum possible score using this molecule 



Rank 






Sequence 



% of max. score 



3925227.1 



Score 



1632 



1640 



0.09 % 



0.04 % 



3607.31448 



1748.2560912 



1492.58592 



1434.16845 



1776 



2512 



SEQ ID NO: 7923 



SEQ ID NO: 7924 



0.03 % 



0.03 % 



1073 



230 




1001 



716 



SEQ ID NO: 7927 



0.03 % 



0.01 % 



0.01 % 



0.01 % 



1338.876 



685.78272 



559.8936 



558.27486 
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2280 



590 



SEO ID NO: 7929 



SEQ ID NO: 7930 



664 



1094 






1735 



1625 



1974 



2382 



2417 



744 



SEQ ID NO: 7940 



0.01 % 



0.01 % 



0.01 % 



0.00 % 



0.00 % 



0.00 % 



0.00 % 



0.00 % 



0.00 % 



0.00 % 



0.00 % 



0.00 % 



511.19781048 



469.6692 



442.076389524 



382.536 



382.536 



342.4606344 



336.885048 



319.9392 



319.9392 



256.416670125 



232.52724 



228.0411084 



Maximum possible 



HLA A 0201 - 10 mers 



score using this molecule 



Rank 







2511 



1608 



SEO ID NO: 



SEQ ID NO: 



7941 



7942 



1171 



1691 



20 



1632 



2280 



1963 






SEO ID NO: 7944 



SEO ID NO: 7946 



1955 



741 



523 



1073 



2489 



777 



1737 



589 



229 



SEQ ID NO: 7960 



9/6 of max. score 



0.38 % 



0.05 % 



0.04 % 



0.03 % 



0.03 % 



0.02 % 



0.01 % 



0.01 % 



0.01 % 



0.01 % 



0.00 % 



0.00 % 



0.00 % 



0.00 % 



0.00 % 



0.00 % 



0.00 % 



0.00 % 



0.00 % 



0.00 % 



3925227.1 



Score 



15126.90795 



2049.4656 



1879.5921264 



1566.6522795 



1338.876 



1107.960876 






549.9372312 



479.041993296 



472.418344576987 



358.73928 



331.093464 



318.652488 



278.7876 



266.6988828 



243.432 



218.5730664 



218.0785572 



210.538251 



205.230564 



HLA A 1101 - 9 mers 



Maximum possible score using this molecule 




^ 15 - 
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Rank 



Start position 



2337 



Sequence 



SEO ID NO: 7961 



% of max. score 



Score 



12 



33.33 % 



25 % 



2156 



492 



SEO ID NO: 



SEP ID NO: 



7962 



7963 



20 % 



16.66 % 



SEO ID NO: 



SEO ID NO: 



7964 



7965 



16.66 % 



16.66 % 



415 



2479 



SEO ID NO: 



SEQ ID NO: 



7966 



7967 



16.66 % 



11.11 % 



1495 



2035 



SEO ID NO: 



SEQ ID NO: 



7968 



7969 



11.11 % 



10 % 



1349 



1194 



SEO ID NO: 



SEQ ID NQ: 



7970 



7971 



8.33 % 



8.33 % 



1648 



96 



SEQ ID NQ: 



SEQ ID NQ: 



7972 



7973 



6.66 % 



6.66 % 



SEQ ID NO: 



SEO ID NO: 



7974 



7975 



6.66 % 



6.66 % 



2345 



698 



SEO ID NO: 



SEQ ID NO: 



7976 



7977 



5.55 % 



5.55 % 



1355 



1987 



2085 



SEO ID NO: 



SEQ ID NO: 



SEQ ID NO: 



7978 



7979 



7980 



5.55 % 



5.55 % 



HLA A 1101 - 10 mers 



Maximum possible score using this molecule 



36 



Score 



Rank 



Start position 



Sequence 



of max. score 



33.33 % 



2083 



2123 



SEQ ID NO: 



SEO ID NO: 



7981 



7982 



25 % 



16.66 % 



2147 



331 



SEQ ID NO: 



SEO ID NO: 



7983 



7984 



12.5 % 



11.11 % 



1035 



1064 



SEQ ID NO: 



SEO ID NO: 



7985 



7986 



11.11 % 



11.11 % 



2154 



1048 



SEO ID NO: 



SEO ID NO: 



7987 



7988 



7.5 % 



6.66 % 



SEQ ID NO: 



SEO ID NO: 



7989 



7990 



6.66 % 



6.66 % 



2109 



2230 



SEO ID NO: 



SEQ ID NO: 



7991 



7992 



6.66 % 



5.55 % 



5.55 % 



1306 



1622 



SEO ID NO: 



SEO ID NO: 



7993 



7994 



- 416 - 
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1772 



1796 



SEO ID NO: 7995 



5.55 % 



5.55 % 



SEQ ID NO: 7996 



SEO ID NO: 7997 



SEQ ID NO: 7998 



SEO ID NO: 7999 



SEQ ID NO: 8000 



697 



1175 



HLA B7 - 9 mers 



usinq this molecule 



Maximum possible score 



5400 



Score 



Rank 



Start position 



1447 



% of max. score 



14.81 % 



SEQ ID NO: 



SEO ID NO: 



8001 



8002 



3.70 % 



2.22 % 



SEO ID NO: 



SEO ID NO: 



8003 



8004 



120 



i 80 



1.48 % 



1.48 % 



SEO ID NO: 



SEQ ID NO: 



8005 



8006 



459 



1475 



1.48 % 



1.48 % 



SEQ ID NO: 



SEO ID NO: 



8007 



8008 



1867 



2032 



1.48 % 



1.48 % 



SEQ ID NO: 



SEQ ID NO: 



8009 



8010 



2047 



2335 



1.48 % 



1.48 % 



SEQ ID NO: 



SEO ID NO: 



8011 



8012 



622 



1375 



1.11 % 



1.11 % 



SEQ ID NO: 



SEQ ID NO: 



8013 



8014 



1617 



1023 



0.92 % 



0.83 % 



SEO ID NO: 



SEQ ID NO: 



8015 



8016 



0.74 % 



0.74 % 



SEO ID NO: 



SEQ ID NO: 



8017 



8018 



810 



1420 



1854 



0.74 % 



0.74 % 



0.74 % 



SEQ ID NO: 



SEQ ID NO: 



8019 



8020 



HLA B7 - 10 mers 



Maximum possible score usinq this molecule 



5400 



Rank 



Start position 



Sequence 



SEO ID NO: 8021 



Vo of max. score 



3.70 % 



Score 



200 



1617 



752 



1552 



154 



165 



SEQ ID NO: 8022 



SEQ ID NO: 8023 



2.22 % 



2.22 % 



SEQ ID NO: 8024 



SEQ ID NO: 8025 



1.48 % 



1.48 % 



- 417 - 
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383 



1501 



SEO ID NO: 



SEQ ID NO: 



8026 



8027 



1.48 % 



1.48 % 



2093 



2564 



SEQ ID NO: 



SEQ ID NO: 



8028 



8029 



1.48 % 



1.48 % 



622 



1086 



1262 



1556 



SEQ ID NO: 



SEO ID NO: 



8030 



8031 



SEO ID NO: 



SEQ ID NO: 



8032 



8033 



1.11 % 



1.11 % 



1.11 % 



1.11 % 




SEO ID NO: 



SEQ ID NO: 



8034 



8035 



1 % 



0.74 % 



SEO ID NO: 



SEQ ID NO: 



8036 



8037 



0.74 % 



0.74 % 



1858 



2107 



2582 



SEO ID NO: 



SEQ ID NO: 



SEQ ID NO: 



8038 



8039 



8040 



0.74 % 



0.74 % 



Table 16: Epitop^ for SEQ ID NO: 6042 




HLA A1 - 9 mers 



Maximum possible score using this molecule tvoe 



Start position 



846 



Sequence 



1178 



637 



557 



1020 



282 



1241 



SEQ ID NO: 



SEQ ID NO: 



8041 



8042 



SEQ ID NO: 



SEQ ID NO: 



8043 



8044 



SEQ ID NO: 



SEQ ID NO: 



8045 



8046 



SEQ ID NO: 



SEO ID NO: 



SEQ ID NO: 



8047 



8048 



SEQ ID NO: 



SEO ID NO: 



8049 



8050 



SEQ ID NO: 



SEQ ID NO: 



8051 



8052 



SEQ ID NO: 



SEQ ID NO: 



8053 



8054 



SEQ ID NO: 



SEQ ID NO: 



8055 



8056 



8057 



% of max. score 



2.22 % 



1.6 % 



0.88 % 



0.88 % 



0.44 % 



0.44 % 



0.32 % 



0.24 % 



0.22 % 



0.2 % 



0.17 % 



0.16 % 



0.16 % 



0.13 % 



0.13 % 



0.12 % 





^ 18 - 































































































































wo 2004/092360 
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SEO ID NO: 8058 



SEQ ID NO: 8059 



SEQ ID NO: 8060 



0.08 % 



0.08 % 



0.08 % 



HLA A1 - 10 mers 



using this molecule 



Maximum possible score 



5625 



Score 



Rank 



Start position 



1241 



Sequence 



% of max. score 



4.8 % 




967 



1010 



426 



SEQ ID NO: 



SEQ ID NO: 



8061 



8062 



SEQ ID NO: 



SEQ ID NO: 



8063 



8064 



0.8 % 



0.48 % 



0.44 % 



809 



1178 



SEQ ID NO: 



SEQ ID NO: 



8065 



8066 



0.44 % 



0.44 % 




SEQ ID NO: 



SEQ ID NO: 



SEQ ID NO: 



8067 



8068 



0.22 % 



0.22 % 



12.5 



12.5 



8069 



8070 



0.2 % 



0.17 % 



11.25 



10 



12 



1181 



SEO ID NO: 



SEQ ID NO: 



8071 



8072 



0.13 % 



0.12 % 



SEO ID NO: 



SEQ ID NO: 




SEQ ID NO: 



SEQ ID NO: 



8073 



8074 



0.11 % 



0.11 % 




SEQ ID NO: 



SEQ ID NO: 



SEQ ID NO: 



8075 



8076 



0.04 % 



0.04 % 



8077 



8078 



0.04 % 



0.04 % 



6.75 



6.25 



6.25 



2.5 



SEQ ID NO: 



SEQ ID NO: 



8079 



8080 



0.04 % 



0.04 % 



HLA A3 - 9 mers 



Maximum possible score using this molecule 



Rank 



Start position 



768 



808 



85 



663 



Sequence 



SEQ ID NO: 8081 



SEO ID NO: 8082 



SEQ ID NO: 8083 



SEO ID NO: 8084 



% of max. score 



0.82 % 



0.49 % 



0.24 % 



0.24 % 



12150 



Score 



100 



1245 



288 



SEQ ID NO: 8085 



0.14 % 



0.09 % 



SEQ ID NO: 8086 



SEQ ID NO: 8087 



SEQ ID NO: 8088 



0.08 % 



0.07 % 



- 419 - 
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SEO ID NO; 



SEQ ID NO; 



8089 



8090 



0.07 % 



0.07 % 



SEQ ID NO: 



SEQ ID NO: 



8091 



8092 



0.06 % 



0.05 % 



8.1 



6;75 



SEQ ID NO: 



SEQ ID NQ: 



8093 



8094 



0.04 % 



0.04 % 



SEQ ID NO: 



SEQ ID NO: 



8095 



8096 



0.03 % 



0.03 % 



706 



1204 



SEQ ID NO: 



SEQ ID NO: 



8097 



8098 



0.03 % 



0.03 % 



4.05 



4.05 



1178 



343 



SEQ ID NQ: 



SEQ ID NO: 



8099 



8100 



0.03 % 



0.02 % 



HLA A3 - 10 mers 



using this molecule 



Maximum possible score 



12150 



Score 



Rank 



Start position 



255 



Sequence 



9^ of max. score 



1.48 % 



SEO ID NO: 



SEQ ID NO: 



8101 



8102 



180 



67.5 



0.55 % 



0.49 % 



SEQ ID NO: 



SEQ ID NQ: 



8103 



8104 



1177 



380 



0.49 % 



0.24 % 



SEQ ID NO: 



SEQ ID NO: 



8105 



8106 



30 



22.5 



0.18 % 



0.16 % 



SEO ID NO: 



SEQ ID NO: 



8107 



8108 



1217 



207 



0.16 % 



0.14 % 



SEO ID NO: 



SEQ ID NO: 



8109 



8110 



1183 



38 



0.14 % 



0.09 % 



SEQ ID NO: 



SEQ ID NO: 



8111 



8112 



0.09 % 



0.06 % 



SEQ ID NO: 



SEQ ID NO: 



8113 



8114 



0.06 % 



0.05 % 






6.75 



6.075 



1204 



50 



SEQ ID NO: 8116 



0.05 % 



0.04 % 



SEQ ID NO: 8117 



SEQ ID NO: 8118 



0.04 % 



0.04 % 



0.04 % 



SEO ID NO: 8119 



SEQ ID NO: 8120 



HLA A24 - 9 mers 



Maximum possible score using this molecule 



1596.672 



- 420 - 







































































































































Rank Start position 



1118 



620 



1068 



E=iar.ij[.a=tOTii 



% of max, score || Score 



19.84% II 316.8 









18.78 % 



18.78 % 



SEQ ID NO: 8124 






18.78 % 



13.77 % 






12.52 % 



7.51 % 



SEQ ID NO: 8128 






7.51 % 



3.75 % 



SEQ ID NO: 8130 






3.44 % 



3.13 % 









2.34 % 



1.87 % 



1166 



45 



185 



SEQ ID NO: 8134 



■agai.ntta;tnH 



1.75 % 



1.56 % 






SEQ ID NO: 8137 



SEQ ID NO: 8138 



1.56 % 



1.31 % 






SEQ ID NO: 8140 



1.26 % 



1.25 % 



1.25 % 




HLA A24 - 10 mers 



Maximum possible score usinq this molecule 



Rank 



Start position 



438 



1047 



473 



SEQ ID NO: 8141 









SEQ ID NO: 8144 






SEQ ID NO: 8146 



SEQ ID NO: 8147 



SEQ ID NO: 8148 



SEQ ID NO: 8149 



SEQ ID NO: 8150 






of max. score 



27.55 % 



22.54 % 



18.78 % 



11.27 % 



11.27 % 



11.27 % 



7.51 % 



6.26 % 



6.26 % 



5.63 % 






3.13 % 



2.70 % 



SEQ ID NO: 8153 



SEQ ID NO: 8154 



1.87 % 



1.50 % 




37.5 



30 



20.16 



20 



20 



1596.672 



Score 




50 



43.2 





All- 
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782 



1186 



183 



1069 






SEQ ID NO: 8156 






SEQ ID NO: 8160 



1.50 % 



1.31 % 



1.05 % 



0.93 % 



21 



16.8 



0.93 % 



0.93 % 



Maximum possible 



Rank 



HLA A 0201 - 9 mers 



score using this molecule 






1041 



981 



1173 



733 












SEQ ID NO: 8164 









SEQ ID NO: 8168 






595 



1095 



1166 



845 



1213 



105 



SEQ ID NO: 8170 



SEQ ID NO: 8171 









SEQ ID NO: 8174 






SEQ ID NO: 8176 






SEQ ID NO: 8178 



SEO ID NO: 8179 



SEQ ID NO: 8180 



% of max. score 



0.01 % 



0.00 % 



0.00 % 



0.00 % 



0.00 % 



0.00 % 



0.00 % 



0.00 % 



0.00 % 



0.00 % 



0.00 % 



0.00 % 



0.00 % 



0.00 % 



0.00 % 



0.00 % 



0.00 % 



0.00 % 



0.00 % 



0.00 % 



3925227.1 



Score 



484.2379773 



382.536 



342.4606344 



232.6931712 



201.447432 



171.86796 



119.463012 



102.17550222 









89.41779 



87.58584 



79.642008 



71.872056 



50.232 



49.13352 



48.732354 



HLA A 0201 - 10 mers 



Maximum possible score using this molecule type 



Rank 






162 



1076 



1196 












SEQ ID NO: 8184 






% of max. score 



0.04 % 



0.02 % 



0.01 % 



0.01 % 



0.01 % 



3925227.1 



Score 



1896.33528 



1134.00849744 



685.3897512 



640.90320525 



623.742666372 




^ 22 - 
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HLA A 1101 - 10 mers 



using this molecuie 



Maximum possibie score 



36 



Score 



4.8 



Rank 



Start position 



38 



Sequence 



of max. score 



13.33 % 



SEO ID NO: 



SEO ID NO: 



8221 



8222 



12.5 % 



11.11 % 



SEO ID NO: 



SEO ID NO: 



8223 



8224 



11.11 % 



10 % 



SEQ ID NO: 



SEO ID NO: 



8225 



8226 



8.33 % 



6.66 % 



SEO ID NO: 



SEO ID NO: 



8227 



8228 



5.55 % 



5 % 



SEO ID NO: 



SEO ID NO: 



8229 



8230 



3.33 % 



3.33 % 



SEO ID NO: 



SEO ID NO: 



8231 



8232 



1177 



429 



SEO ID NO: 



SEQ ID NO: 



8233 



8234 



2.77 % 



2.5 



SEQ ID NO: 



SEO ID NO: 



8235 



8236 



2.22 % 



2.22 % 



SEO ID NO: 



SEO ID NO: 



8237 



8238 



1157 



170 



893 



2.22 % 



1.66 % 



1.66 % 



SEQ ID NO: 



SEQ ID NO: 



8239 



8240 



HLA B7 - 9 mers 



Maximum possible score using this molecule 



5400 



Score 



Rank 



Start position 



200 



Sequence 



SEO ID NO: 8241 



% of max. score 



1.48 % 



1243 



123 



SEQ ID NO: 



SEQ ID NO: 



8242 



8243 



1.48 % 



0.74 % 



248 



1036 



SEO ID NO: 



SEO ID NO: 



8244 



8245 



0.66 % 



0.66 % 



SEQ ID NO: 



SEO ID NO: 



SEQ ID NO: 



8246 



8247 



0.37 % 



0.37 % 



8248 



8249 



0.37 % 



0.37 % 



SEQ ID NO; 



SEQ ID NO: 



SEQ ID NO: 



8250 



8251 



0.37 % 



0.33 % 



- 424 - 
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PCT/US2004/011710 
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Start position 



Sequence 



SEQ ID NO: 8281 



9/6 of max. score H Score 



0.2 % D 11.25 



SEQ ID NO: 



SEQ ID NO: 



8282 



8283 



0.08 % 



0.08 % 



SEQ ID NO: 



SEQ ID NO: 



8284 



8285 



0.04 % 



0.04 % 



SEQ ID NO: 



SEQ ID NO: 



8286 



8287 



0.04 % 



0.02 % 



SEQ ID NO: 



SEQ ID NO: 



8288 



8289 



0.02 % 



0.02 % 



SEQ ID NO: 



SEQ ID NO: 



8290 



8291 



0.01 % 



0.01 % 



SEQ ID NO: 



SEQ ID NO: 



8292 



8293 



0.01 % 



0.01 % 



SEQ ID NO: 



SEQ ID NO: 



8294 



8295 



0.01 % 



0.00 % 



SEQ ID NO: 



SEQ ID NO: 



8296 



8297 



0.00 % 



0.00 % 



SEQ ID NO: 



SEQ ID NO: 



SEQ ID NO: 



8298 



8299 



8300 



0.00 % 



0.00 % 



0.00 % 



HLA A1 - 10 mers 



Maximum possible score using this molecule 



Rank 



Start position 



Sequence 



SEQ ID NO: 8301 



SEQ ID NO: 



SEQ ID NO: 



8302 



8303 



SEQ ID NO: 



SEQ ID NO: 



8304 



8305 



SEQ ID NO: 



SEQ ID NO: 



8306 



8307 



SEQ ID NO: 



SEQ ID NO: 



8308 



8309 



SEQ ID NO: 



SEQ ID NO: 



8310 



8311 



SEQ ID NO 



SEQ ID NO 



SEQ ID NO 



8312 



8313 



8314 




of max. score 



0.8 % 



0.8 % 



0.48 % 



0.2 % 



0.17 % 



0.13 % 



0.04 % 



0.04 % 



0.04 % 



0.02 % 



0.02 % 



0.02 % 



0.01 % 



0.01 % 



2.25 



1.35 



1.25 



1.25 



0.625 



0.625 



5625 



Score 



27 



11.25 



2.25 



1.25 



1.25 



1.25 



- 426 - 
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SEO ID NO: 8315 



SEQ ID NO: 8316 



SEQ ID NO: 8317 



0.01 % 



0.00 % 



0.75 



0.00 % 



0.00 % 



SEQ ID NO: 8318 



SEQ ID NO: 8319 



SEQ ID NO: 8320 



0.00 % 



0.00 % 



HLA A3 - 9 mers 



Maximum possible score usinq this molecule 



12150 



Score 



Rank 



Start position 



104 



Sequence 



SEO ID NO: 8321 



‘Yo of max. score 



0.98 % 




SEQ ID NO: 



SEQ ID NO: 



8322 



8323 



0.74 % 



0.44 % 



54 



13.5 



SEQ ID NO: 



SEQ ID NO: 



8324 



8325 



0.11 % 



0.08 % 



10.8 



10 



SEQ ID NO: 



SEQ ID NO: 



SEQ ID NO: 



8326 



8327 



8328 



0.08 % 



0.07 % 



0.06 % 



0.04 % 






SEQ ID NO: 



SEQ ID NO: 



8330 



8331 



0.03 % 



0.03 % 



4.05 



3 



3 



SEQ ID NO: 



SEQ ID NO: 



8332 



8333 



0.02 % 



0.02 % 



SEQ ID NO: 



SEO ID NO: 



8334 



8335 



0.02 % 



0.02 % 



SEQ ID NO: 



SEQ ID NO: 



8336 



8337 



0.02 % 



0.01 % 



1.8 



1.215 



SEQ ID NO: 



SEQ ID NO: 



SEQ ID NO: 



8338 



8339 



8340 



0.01 % 



0.00 % 



0.00 % 



HLA A3 - 10 mers 



Maximum possible score using this molecule 



12150 



Score 



Rank 



Start position 



Sequence 



SEO ID NO: 8341 



% of max. score 



1.33 % 



SEQ ID NO: 8342 



SEQ ID NO: 8343 



0.44 % 



0.24 % 



30 



20 



13.5 



SEO ID NO: 8344 



SEQ ID NO: 8345 



0.16 % 



0.11 % 



^ 27 - 
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SEO ID NO: 



SEQ ID NO: 



8346 



8347 



SEQ ID NO: 



SEQ ID NO: 



8348 



8349 



SEQ ID NO: 



SEQ ID NO: 



8350 



8351 



0.07 % 



0.04 % 



0.04 % 



0.03 % 



SEO ID NO: 



SEO ID NO: 



8352 



8353 



SEQ ID NO: 



SEQ ID NO: 



8354 



8355 



SEO ID NO: 



SEQ ID NO: 



8356 



8357 



SEO ID NO: 



SEO ID NO: 



SEQ ID NO: 



8358 



8359 



8360 



0.02 % 



0.02 % 



0.01 % 



0.01 % 



0.01 % 



0.00 % 



0.00 % 



0.00 % 



0.00 % 



0.00 % 



0.00 % 



Maximum possible score 



Rank I Start position 



HLA A24 - 9 mers 



using this molecule 






Vo of max. score 



13.15 % 









9.39 % 



4.69 % 



SEO ID NO: 8364 






3.15 % 



2.63 % 









2.10 % 



1.87 % 






SEQ ID NO: 8369 



SEO ID NO: 8370 



0.65 % 



0.56 % 



■a I rt 



0.55 % 



0.54 % 









0.50 % 



0.50 % 









0.46 % 



0.45 % 






SEO ID NO: 8377 



0.37 % 



0.37 % 






SEQ ID NO: 8380 



0.37 % 



0.37 % 



0.37 % 



1.8 



1.35 



1.35 



0.9 



0.81 



1596.672 



Score 



75 



50.4 



42 



33.6 



9 



8.8704 



8.64 



8 



8 



- 428 - 
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Maximum possible score 



Rank 



Start position 



105 



HLA A24 • 10 mers 



usinq this molecule 



% 






of max. score 



22.54 % 



SEQ ID NO: 8382 






17.53 % 



3.13 % 



SEO ID NO: 8384 






1.87 % 



1.87 % 



SEQ ID NO: 8386 






1.12 % 



0.84 % 









0.56 % 



0.55 % 



SEO ID NO: 8390 






0.54 % 



0.46 % 



SEQ ID NO: 8392 






0.45 % 



0.43 % 



SEQ ID NO: 8394 



0.37 % 



0.37 % 















SEQ ID NO: 8400 



0.37 % 



0.37 % 



0.37 % 



0.37 % 



0.37 % 



1596.672 



Score 



13.44 



9 



8.8704 



8.64 



.5 



HLA A 0201 - 9 mers 



Maximum possible score usinq this molecule 



Rank 






SEO ID NO: 



SEO ID NO: 



8401 



SEQ ID NO: 



SEQ ID NO: 



8402 



8403 



SEO ID NO: 



SEO ID NO: 



8404 



8405 



SEO ID NO: 



SEQ ID NO: 



8406 



8407 



SEO ID NO: 



SEQ ID NO: 



SEO ID NO: 



8408 



8409 



8410 



8411 






% of max. score 



0.21 % 



0.10 % 



0.03 % 



0.00 % 



0.00 % 



0.00 % 



0.00 % 



0.00 % 



0.00 % 



0.00 % 



0.00 % 



3925227.1 



Score 



8532.082944 



3977.8497792 



1243.078056 



348.872832 



243.432 



130.26096 



91.182672 



,73.342584 



70.386624 



65.32728732 



47.876409 



- 429 - 

































































































































wo 2004/092360 



PCT/US2004/011710 



SEO ID NO: 8412 



SEQ ID NO: 8413 



SEO ID NO: 8414 






SEO ID NO: 8416 



SEO ID NO: 8417 



SEO ID NO: 8418 



SEQ ID NO: 8419 



SEQ ID NO: 8420 



0.00 % 



0.00 % 



0.00 % 



0.00 % 



0.00 % 



0.00 % 



0.00 % 



0.00 % 



0.00 % 



36.8637882 



30.889782 



21.616753941 



19.657134 



18.4318941 



16.496865 



15.783256167 



13.9968225 



10.31851392 



Maximum possible 



HLA A 0201 - 10 



score using this molecule 



mers 



Rank 






SEQ ID NO: 8421 





















SEQ ID NO: 8428 






SEQ ID NO: 8430 



SEQ ID NO: 8431 









SEQ ID NO: 8434 






SEQ ID NO: 8436 



SEO ID NO: 8437 



SEO ID NO: 8438 



SEQ ID NO: 8439 



SEQ ID NO: 8440 



of max. score 



0.01 % 



3925227.1 



Score 



0.01 % 



0.00 % 



0.00 % 



0.00 % 



0.00 % 



0.00 % 



0.00 % 



0.00 % 



0.00 % 



0.00 % 



0.00 % 



0.00 % 



0.00 % 



0.00 % 



0.00 % 



0.00 % 



0.00 % 



0.00 % 



0.00 % 



556.494246 



395.245972224 



201.554244 



143.65707264 



132.54624 



84.78671286 



69.552 



52.7237901 



47.99088 



37.4509575 



31.24872 



31.1384304 



27.531 



22.9153278 



22.1063618985 



21.55457052 



21.3624 



19.657134 



19.657134 



13.87622016 



HLA A 1101 - 9 mers 



Maximum possible score using this molecule 



Rank 



Start position 



58 



125 



Sequence 



SEQ ID NO: 8441 



SEQ ID NO: 8442 



% of max. score 



5.55 % 



1.66 % 



36 



Score 



^ 30 - 
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SEO ID NO: 8443 



SEP ID NO: 8444 



1.66 % 



1.66 % 



HLA A 1101 - 10 mers 



Maximum possible score using this molecule 



Rank 



Start position 



Sequence 



SEO ID NO: 8445 



SEO ID NO: 8446 



% of max. score 



2.22 % 



2.22 % 



HLA B7 - 9 mers 



Maximum possible score using this molecule 



Rank 



Start position 



Sequence 



SEO ID NO: 8447 



SEO ID NO: 8448 



SEO ID NO: 8449 



SEQ ID NO: 8450 



SEO ID NO: 8451 






SEO ID NO: 



SEQ ID NO: 



8453 



8454 



% of max. score 



0.66 % 



0.37 % 



0.33 % 



0.33 % 



0.22 % 



SEQ ID NO: 



SEQ ID NO: 



8455 



8456 



SEO ID NO: 



SEQ ID NO: 



8457 



8458 



SEQ ID NO: 



SEQ ID NO: 



8459 



8460 



SEO ID NO: 



SEQ ID NO: 



8461 



8462 



SEQ ID NO: 



SEO ID NO: 



8463 



8464 



SEQ ID NO: 



SEQ ID NO: 



8465 



8466 



0.22 % 



0.11 % 



0.07 % 



0.07 % 



0.07 % 



0.07 % 



0.07 % 



0.07 % 



0.07 % 



0.07 % 



0.07 % 



0.03 % 



0.03 % 



0.03 % 



HLA B7 - 10 mers 



Maximum possible score using this molecule 



Rank 



Start position 



Sequence 



SEO ID NO: 8467 



SEQ ID NO: 8468 



SEO ID NO: 8469 



% of max. score 



3.33 % 



1.48 % 



0.66 % 



36 



Score 



5400 



Score 



5400 



Score 



^ 31 - 

























































































































SEO ID NO: 



SEQ ID NO: 



SEO ID NO: 



SEO ID NO: 



8470 



8471 



8472 



8473 



0.37 % 



0.37 % 



0.22 % 



0.22 % 





SEO ID NO: 



SEQ ID NO: 



8474 



8475 



SEQ ID NO: 



SEQ ID NO: 



8476 



8477 



0.22 % 



0.22 % 



0.22 % 



0.14% 




SEQ ID NO: 



SEQ ID NQ: 



8478 



8479 



SEQ ID NQ: 



SEQ ID NQ: 



8480 



8481 



SEQ ID NQ: 



SEQ ID NO: 



8482 



8483 




SEO ID NO: 



SEQ ID NO: 



SEQ ID NO: 



8484 



8485 



8486 



0.14% 



0.14 % 



0.11 % 



0.11 % 



0.11 % 



0.07% 



0.07 % 



0.07 % 



Table 18: Epitopes for SEQ ID NO: 6044 



HLA A1 - 9 mers 



Maximum possible score using this molecule 



Rank 



Start position 



69 



89 



Sequence 



SEQ ID NO: 8487 



SEQ ID NO: 8488 



SEQ ID NO: 8489 



SEQ ID NO: 8490 



9/6 of max. score 



0.04 % 



0.02 % 



0.01 % 



0.00 % 



HLA A1 - 10 mers 



Maximum possible score using this molecule 



Rank 



Start position 



SEQ ID NO: 8491 



SEQ ID NO: 8492 



SEQ ID NO: 8493 



SEQ ID NO: 8494 



SEQ ID NO: 8495 



Vo of max. score 



0.02 % 



0.02 % 



0.02 % 



0.00 % 



0.00 % 



HLA A3 - 9 mers 



5625 



Score 



5625 



Score 



1.25 



0.5 



0.5 



- 432 - 
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Maximum possible score using this molecule type 



12150 



Score 



Rank 



Start position 



Sequence 



SEO ID NO: 8496 



% of max. score 



1.23 % 



SEQ ID NO: 



SEO ID NO: 



8497 



8498 



1.11 % 



0.49 % 



SEO ID NO: 



SEO ID NO: 



8499 



8500 



0.32 % 



0.08 % 



SEQ ID NO: 



SEQ ID NO: 



8501 



8502 



0.04 % 



0.01 % 



SEQ ID NO: 



SEQ ID NO: 



8503 



8504 



0.01 % 



0.01 % 



SEQ ID NQ: 



SEQ ID NO: 



8505 



8506 



0.00 % 



0.00 % 



SEQ ID NO: 



SEO ID NO: 



8507 



8508 



0.00 % 



0.00 % 



SEQ ID NQ: 



SEQ ID NQ: 



8509 



8510 



0.00 % 



0.00 % 



SEQ ID NQ: 



SEQ ID NQ: 



SEQ ID NQ: 



8511 



8512 



8513 



0.00 % 



0.00 % 



0.00 % 



HLA A3 - 10 mers 



Maximum possible score using this molecule 



12150 



Score 



Rank 



Start position 



Sequence 



SEQ ID NQ: 8514 



% of max. score 



0.49 % 



SEQ ID NQ: 



SEQ ID NO: 



8515 



8516 



0.16 % 



0.16 % 



SEO ID NO: 



SEQ ID NO: 



8517 



8518 



0.12 % 



0.12 % 



SEQ ID NO: 



SEQ ID NO: 



8519 



8520 



0.04 % 



0.04 % 



SEQ ID NO: 



SEQ ID NO: 



8521 



8522 



0.04 % 



0.04 % 



SEO ID NO: 



SEQ ID NO: 



8523 



8524 



0.04 % 



0.04 % 



SEQ ID NO: 



SEO ID NO: 



8525 



8526 



0.03 % 



0.01 % 



SEQ ID NO: 



SEQ ID NO: 



8527 



8528 



0.01 % 



0.01 % 



^ 33 - 
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SEO ID NO: 8529 



SEQ ID NO: 8530 



SEO ID NO: 8531 



SEO ID NO: 8532 



SEO ID NO: 8533 



0.00 % 



0.00 % 



0.00 % 



0.00 % 



0.00 % 



HLA A24 - 9 mers 



Maximum possible score using this molecule 



Rank 




Start; position 



Sequence 






^/o of max. score 



14.46 % 









3.13 % 



1,87 % 









■d=MiJi!i.a:aka 



0.37 % 



0.37 % 



0.37 % 






SEQ ID NO: 8544 



Ka^>i.ii!i«a:Hi.aii 






0.35 % 



0.35 % 



0.33 % 



Bgj.K.iir.a:i.-eH 



■4J.njii.a=aiii 



0.30 % 






E55BaJMl;UfcWi 






0.25 % 



0.22 % 



1596.672 



Score 





5.76 



5.6 



5.6 



5.28 




HLA A24 - 10 mers 



Maximum possible score using this molecule 




- 434 - 
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SEQ ID NO: 8564 



K=t»najt>gmg 



SEO ID NO: 8566 






E=(ai»lJ[»g:U4Hg 






SEQ ID NO: 8570 









SEQ ID NO: 8573 



0.45 % 



0.41 % 



0.37 % 



0.30 % 



0.30 % 



0.30 % 



0.30 % 



0.27 % 



0.25 % 



0.25 % 



0.25 % 



0.25 % 



0.25 % 



0.25 % 



Maximum possible 



HLA A 0201 - 9 mers 



score using this molecule 



Rank 
























SEQ ID NO: 8579 



SEO ID NO: 8580 









SEQ ID NO: 8583 









<*/o of max. score 



0.00 % 



0.00 % 



0.00 % 



0.00 % 



0.00 % 



0.00 % 



0.00 % 



0.00 % 



0.00 % 



0.00 % 



0.00 % 



i5ijgTi>M»g#a 









SEQ ID NO: 8589 



SEQ ID NO: 8590 









SEQ ID NO: 8593 



0.00 % 



0.00 % 



0.00 % 



0.00 % 



0.00 % 



0.00 % 



0.00 % 



0.00 % 



3925227.1 



136.1646 



69.6969 



60.153786 



52.5182736 



49.13352 



21.99582 



19.42488 



14.6900655 



11.304684 



8.4687081 



6.0996 



5.382 



4.981593 



4.968 



4.966482 



4.4815221585 



4.128201 



3.67632 



3.47553 



3.30993 
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Maximum possible 



HLA A 0201 - 10 mers 



score usinq this molecule 



Rank 















E151BI 



mm 


















SEQ ID NO: 



SEQ ID NO: 



8600 



8601 



SEQ ID NO: 



SEO ID NO: 



8602 



8603 



SEQ ID NO: 



SEQ ID NO: 



8604 



8605 



SEQ ID NO: 



SEO ID NO: 



8606 



8607 



SEQ ID NO: 



SEO ID NO: 



8608 



8609 



SEQ ID NO: 



SEO ID NO: 



8610 



8611 









% of max. score 



0.00 % 



0.00 % 



0.00 % 



0.00 % 



0.00 % 



0.00 % 



0.00 % 



0.00 % 



0.00 % 



0.00 % 



0.00 % 



0.00 % 



0.00 % 



0.00 % 



0.00 % 



0.00 % 



0.00 % 



0.00 % 



0.00 % 



0.00 % 



3925227.1 



Score 



147.97188 



143.59176 



106.83684 



83.526984 



83.526984 



69.552 



61.06776 



26.082 



18.3816 



17.38386 



17.1396 



16.21914 



14.02908 



11.161854 



10.34586 



8.846145 



7.575080337 



7.452 



6.735366 



HLA A 1101 - 9 mers 



Maximum possible score usinq this moiecule 



Rank 



Start position 



Sequence 



SEO ID NO: 8614 



SEQ ID NO: 8615 



SEQ ID NO: 8616 



SEO ID NO: 8617 



SEQ ID NO: 8618 



SEQ ID NO: 8619 



SEQ ID NO: 8620 



9^0 of max. score 



5.55 % 



3.33 % 



2.22 % 



2.22 % 



1.66 % 



36 



Score 



HLA A 1101 - 10 mers 



Maximum possibie score usinq this moiecule 



Rank 



Start position 



Sequence 



SEQ ID NO: 8621 



% of max. score 



36 



Score 



- 436 - 
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SEQ ID NO: 8622 



SEQ ID NO: 8623 



3.33 % 



3.33 % 



SEQ ID NO: 8624 



SEQ ID NO: 8625 



2.77 % 



2.5 % 



HLA B7 • 9 mers 



Maximum possible score using this molecule 



5400 



Score 



Rank Start position 



Sequence 



SEQ ID NO: 8626 



"/o Of max. score 



0.74 % 



SEQ ID NO: 



SEQ ID NO: 



8627 



8628 



0.37 % 



0.33 % ^ 



0.07 % 



SEQ ID NO: 



SEQ ID NO: 



8629 



8630 



SEQ ID NO: 



SEQ ID NO: 



8631 



8632 



0.07 % 



0.07 % 



SEQ ID NO: 



SEQ ID NO: 



8633 



8634 



0.07 % 



0.07 % 



SEQ ID NO: 



SEQ ID NO: 



8635 



8636 



0.07 % 



0.07 % 



SEQ ID NO: 



SEQ ID NO: 



8637 



8638 



0.07 % 



0.07 % 



SEQ ID NO: 



SEQ ID NO: 



8639 



8640 



0.07 % 



0.07 % 



SEQ ID NO: 



SEQ ID NO: 



8641 



8642 



0.07 % 



0.07 % 



SEQ ID NO: 



SEQ ID NO: 



SEQ ID NO: 



8643 



8644 



8645 



0.03 % 



0.03 % 



0.02 % 



HLA B7 - 10 mers 



Maximum possible score using this molecule 



5400 



Score 



Rank 



Start position 



Sequence 



SEQ ID NO: 8646 



% of max. score 



1.48 % 



0.74 % 



SEQ ID NO: 8647 



SEQ ID NO: 8648 



0.37 % 



0.22 % 



SEQ ID NO: 8649 



SEQ ID NO: 8650 



0.11 % 



0.09 % 



0.07 % 




SEQ ID NO: 8651 



SEO ID NO: 8652 



- 437 - 
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SEO ID NO: 



SEQ ID NO: 



8653 



8654 



0.07 % 



0.07 % 



SEO ID NO: 



SEO ID NO: 



8655 



8656 



SEO ID NO: 



SEQ ID NO: 



8657 



8658 



0.07 % 



0.07 % 



0.07 % 



0.07 % 



SEO ID NO: 



SEQ ID NO: 



8659 



8660 



0.07 % 



0.07 % 



SEQ ID NO: 



SEQ ID NO: 



8661 



8662 



0.07 % 



0.07 % 



SEQ ID NO: 



SEQ ID NO: 



SEQ ID NO: 



8663 



8664 



8665 



0.07 % 



0.07 % 



0.03 % 



Table 19: Epitopes for SEQ ID NO: 6045 



■ HLAAl-9mers 



Maximum possible score usinq this molecule 



Rank 



Start position 



Sequence 



SEQ ID NO: 8666 



SEQ ID NO: 



SEQ ID NO: 



8667 



8668 



SEQ ID NO: 



SEQ ID NO: 



8669 



8670 



SEQ ID NO: 



SEO ID NO: 



SEQ ID NO: 



8671 



8672 



8673 



^/o of max. score 



0.02 % 



0.02 % 



0.02 % 



0.01 % 



0.01 % 



0.01 % 



0.01 % 



0.00 % 



HLA A1 - 10 mers 



Maximum possible score using this molecule 



Rank 



Start position 



SEO ID NO: 8674 



SEO ID NO: 8675 



SEO ID NO: 8676 



SEQ ID NO: 8677 



SEQ ID NO: 8678 



SEQ ID NO: 8679 



% of max. score 



0.12 % 



0.12 % 



0.00 % 



0.00 % 



0.00 % 



0.00 % 



5625 



Score 



1.35 



1.35 



1.25 



0.75 



0.75 



0.5 



5625 



Score 



6.75 



6.75 



0.5 




- 438 - 






































































































wo 2004/092360 



PCT/US2004/011710 



HLA A3 - 9,mers 



Maximum possible score using this molecule 



Rank 



Start position 



17 



Sequence 



SEQ ID NO: 8680 



% of max. score 



0.24 % 





SEQ ID NO: 



SEQ ID NO: 



8681 



8682 



0.07 % 



0.06 % 



SEQ ID NO: 



SEQ ID NO: 



8683 



8684 



SEQ ID NO: 



SEQ ID NO: 



8685 



8686 



SEQ ID NO: 



SEQ ID NO: 



8687 



8688 



SEQ ID NO: 



SEQ ID NO: 



SEQ ID NO: 



SEQ ID NO: 



8689 



8690 



8691 



8692 



0.04 % 



0.03 % 



0.02 % 



0.02 % 



0.01 % 



0.01 % 



0.00 % 



0.00 % 



0.00 % 



HLA A3 - 10 mers 



Maximum possible score using this molecule 



Rank 




Start position 



49 



Sequence 



SEQ ID NO: 8693 



SEQ ID NO: 



SEQ ID NO: 



8694 



8695 




SEQ ID NO: 



SEQ ID NO: 



8696 



8697 




SEQ ID NO: 



SEO ID NO: 



8698 



8699 



SEQ ID NO: 



SEQ ID NO: 



8700 



8701 



% of max. score 



0.44 % 



0.22 % 



0.14 % 



0.07 % 



0.04 % 



0.01 % 



0.00 % 



0.00 % 



0.00 % 



HLA A24 - 9 mers 



Maximum possible score using this molecule 



12150 



Score 



3.24 



3 



1.8 



1.8 



1.35 



12150 



Score 




1596.672 
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SEQ ID NO: 8708 



SEQ ID NO: 8709 



SEQ ID NO: 8710 



SEQ ID NO: 8711 



0.52 % 



0.52 % 



0.51 % 



0.48 % 



8.4 



8.4 



8.25 



SEQ ID NO: 8712 



SEQ ID NO: 8713 



SEQ ID NO: 8714 



SEQ ID NO: 8715 



SEQ ID NO: 8716 






0.45 % 



0.37 % 



0.37 % 



0.37 % 



0.35 % 



0.31 % 




SEQ ID NO: 8718 






SEQ ID NO: 8720 






0.30 % 



0.30 % 



0.27 % 



0.25 % 



4.8 



4.8 



4.32 



HLA A24 - 10 mers 



Maximum possible score using this molecule 



Rank II Start position 



55 



2 18 



3 n 21 




■aa.ii.ii!i.a;v»i 



SEQ ID NO: 8723 



reatn.ui.a;fcai 



% of max. score 



18.78 % 



2.63 % 



2.25 % 



1596.672 



Score 






1.87 % 



SEQ ID NO: 8726 







SEQ ID NO: 8728 







0.52 % 



0.42 % 



8.4 



6.72 



SEQ ID NO: 8730 






0.37 % 



0.35 % 




0.35 % 



0.35 % 



0.31 % 



0.25 % 






SEQ ID NO: 8737 



Kaai.ii!i.a;fcfci3 




0.25 % 



0.25 % 



0.22 % 



0.18 % 





SEQ ID NO: 8740 



SEQ ID NO: 8741 



0.18 % 



0.08 % 




440 - 
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Maximum possible 



Rank 



HLA A 0201 - 9 mers 



score using this molecule 







SEO ID NO: 8743 



SEQ ID NO: 8744 






SEQ ID NO: 8746 



SEO ID NO: 8747 



SEQ ID NO: 8748 



SEQ ID NO: 8749 



SEQ ID NO: 8750 












Ba:> TTa7r «]i:wa 






SEQ ID NO: 8756 






LHd>ij»i.u>a;kaa 






SEQ ID NO: 8760 



SEQ ID NO: 8761 



Wo of max. score 



0.03 % 



0.02 % 



0.00 % 



0.00 % 



0.00 % 



0.00 % 



0.00 % 



3925227.1 



Score 



1310.8823136 



1082.4143022 



1040.33238624 



382.536 



342.863529264 



63.28397376 



40.19736105 



21.3624 



0.00 % 



0.00 % 



0.00 % 



19.7762418 



12.6684 



0.00 % 



0.00 % 



0.00 % 



0.00 % 



0.00 % 



0.00 % 



11.544666 



10.4328 



8.4456 



6.2888049 



6.0858 






4.4855150505 






2.88098391 



1.699677 



HLA A 0201 - 10 mers 



Maximum possible score using this molecule 



Rank 






Seauence 









ID NQ: 8764 


















SEQ ID NO: 8770 



SEO ID NO: 8771 






■aa>ai»akntM;y#ici 



% of max. score 



0.16 % 



0.01 % 



0.00 % 



0.00 % 



0.00 % 



0.00 % 



0.00 % 



0.00 % 



0.00 % 



0.00 % 



0.00 % 



0.00 % 



3925227.1 



Score 



6459.14167272 



607.88448 



126.83304 



63.16728165 



53.54651988 



28.51632 



21.8247414 



21.3624 



19.42488 



17.2167282 



15.7068219 



9.8581266 
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wo 2004/092360 



PCT/US2004/011710 









SEO ID NO: 8776 



0.00 % 



0.00 % 






0.00 % 



0.00 % 



E glUin gi 3riri:| 






SEQ ID NO: 8780 



SEQ ID NO: 8781 



0.00 % 



0.00 % 



0.00 % 



0.00 % 



7.3086111 



7.10976 



5.7419523 



4.599126 



2.5495155 



2.52747 



2.383605 



2.332847151 



HLA A 1101 - 9 mers 



Maximum possible score using this molecule 



Rank 



Start position 



Sequence 



SEQ ID NO: 8782 



**/o of max. score 



3.33 % 



HLA A 1101 - 10 mers 



Maximum possible score using this hriolecule 



Rank 



Start position 



% of max. score 




HLA B7 - 9 mers 



Maximum possible score using this molecule 



Rank 



1 . 



Start position 



Sequence 



SEQ ID NO: 8783 



SEQ ID NO: 



SEQ ID NO: 



8784 



8785 



SEQ ID NO: 



SEQ ID NO: 



8786 



8787 



SEO ID NO: 



SEQ ID NO: 



8788 



8789 



SEQ ID NO: 



SEQ ID NO: 



8790 



8791 



SEQ ID NO: 



SEQ ID NO: 



8792 



8793 



SEO ID NO: 



SEQ ID NO: 



8794 



8795 



SEQ ID NO: 



SEQ ID NO: 



8796 



8797 



SEO ID NO: 



SEQ ID NO: 



SEQ ID NO: 



8798 



8799 



8800 



% of max. score 



0.37 % 



0.37 % 



0.37 % 



0.37 % 



0.22 % 



0.07 % 



0.07 % 



0.07 % 



0.07 % 



0.07 % 



0.07 % 



0.07 % 



0.05 % 



0.02 % 



0.01 % 



0.01 % 



0.01 % 



0.00 % 



Score 



5400 



Score 



- 442 - . 





























































































































SEP ID NO: 8801 



0.00 % 



HLA B7 - 10 mers 



Maximum possible score usinq this molecule 



5400 



Score 



20 



Rank Start position 



27 



Sequence 



SEP ID NO: 8802 



9/6 of max. score 



0.37 % 



SEP ID NO! 



SEQ ID NO: 



8803 



8804 



0.07 % 



0.07 % 



SEO ID NO: 



SEO ID NO: 



8805 



8806 



0.07 % 



0.07 % 



SEO ID NO: 



SEQ ID NO: 



8807 



8808 



0.07% 



0.07 % 



SEO ID NO: 



SEQ ID NO: 



8809 



8810 



0.07 % 



0.07 % 



0.07 % 



SEQ ID NO: 



SEQ ID NO: 



8811 



8812 



0.03 % 



0.03 % 



SEQ ID NO: 



SEQ ID NO: 



8813 



8814 




0.02 % 



0.01 % 



SEQ ID NO: 



SEQ ID NO: 



8815 



8816 



0.01 % 



0.01 % 



SEQ ID NO: 



SEQ ID NO: 



8817 



8818 



0.00 % 



0.00 % 



0.00 % 



SEO ID NO: 



SEQ ID NO: 



8819 



8820 



Table 20: Epitopes for SEQ ID NO: 6046 



Maximum possible score usinq this molecule tvoe 



Start position 




HLA A1 > 9 mers 



Sequence 



SEQ ID NO: 8821 



SEQ ID NO: 



SEQ ID NO: 



8822 



8823 



SEQ ID NO: 



SEQ ID NO: 



SEQ ID NO: 



SEQ ID NO: 



SEQ ID NO: 



8824 



8825 



8826 



8827 



8828 



% of max. score 



2.22 % 



0.88 % 



0.08 % 



0.04 % 



0.04 % 



0.04 % 



0.04 % 



0.02 % 



5625 



Score 



2.5 



2.5 



1.35 











































































































wo 2004/092360 



PCT/DS2004/011710 



SEO ID NO: 



SEQ ID NO: 



8829 



8830 



SEO ID NO: 



SEO ID NO: 



8831 



8832 



SEO ID NO: 



SEO ID NO: 



8833 



8834 



SEO ID NO: 



SEO ID NO: 



SEO ID NO: 



8835 



8836 



8837 



0.02 % 



0.01 % 



0.01 % 



0.01 % 



0.01 % 



0.00 % 



0.00 % 



0.00 % 



0.00 % 



HLA A1 - 10 mers 



Maximum possible score using this molecule 



Rank 



Start position 



Sequence 



SEQ ID NO: 8838 



SEQ ID NO: 8839 



SEQ ID NO: 8840 



SEO ID NO: 8841 



SEQ ID NO: 8842 



SEQ ID NO: 8843 



SEQ ID NO: 8844 






SEQ ID NO: 



SEQ ID NO: 



8846 



8847 



SEQ ID NO: 



SEO ID NO: 



8848 



8849 



SEQ ID NO: 



SEO ID NO: 



SEQ ID NO: 



8850 



8851 



8852 



o/o of max. score 



0.04 % 



0.04 % 



0.04 % 



0.04 % 



0.04 % 



0.03 % 



0.02 % 



0.02 % 



0.01 % 



0.01 % 



0.01 % 



0.00 % 



0.00 % 



0.00 % 



0.00 % 



HLA A3 - 9 mers 



Maximum possible score using this molecule 



Rank 



Start position 



43 



Sequence 



SEQ ID NO: 8853 



SEQ ID NO: 8854 



SEQ ID NO: 8855 






SEO ID NO: 8857 



SEQ ID NO: 8858 



SEQ ID NO: 8859 



% of max. score 



0.24 % 



0.09 % 



0.05 % 



0.04 % 



0.04 % 



0.04 % 



1.125 



0.625 



0.5 



5625 



Score 



2.5 



2.25 



1.8 



1.25 



1.125 



0.75 



0.625 



12150 



Score 



12 



6.75 



6 
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wo 2004/092360 



PCT/OS2004/011710 



SEO ID NO: 8860 



SEQ ID NO: 8861 



SEQ ID NO: 8862 






SEQ ID NO: 



SEQ ID NO: 



8864 



8865 



SEQ ID NO: 



SEQ ID NO: 



8866 



8867 



SEQ ID NO: 



SEO ID NO: 



8868 



8869 



SEQ ID NO: 



SEO ID NO: 



SEQ ID NO: 



8870 



8871 



8872 



0.03 % 



0.03 % 



0.02 % 



0.02 % 



0.02 % 



0.02 % 



0.01 % 



0.01 % 



0.01 % 



0.01 % 



0.01 % 



0.01 % 



0.00 % 



HLA A3 - 10 mers 



Maximum possible score using this moiecuie 



Rank 



Start position 



Sequence 



SEQ ID NO: 8873 



SEQ ID NO: 



SEQ ID NO: 



8874 



8875 



SEQ ID NO: 



SEO ID NO: 



8876 



8877 



SEQ ID NO: 



SEQ ID NO: 



8878 



8879 



SEO ID NO: 



SEQ ID NO: 



8880 



8881 



% of max. score 



0.22 % 



0.18 % 



0.08 % 



0.07 % 



0.06 % 



0.04 % 



0.04 % 



0.04 % 



0.03 % 



SEO ID NO: 



SEQ ID NO: 



8882 



8883 



SEO ID NO: 



SEQ ID NO: 



8884 



8885 



SEO ID NO: 



SEQ ID NO: 



8886 



8887 



SEQ ID NO: 



SEQ ID NO: 



8888 



8889 



SEO ID NO: 



SEQ ID NO: 



SEQ ID NO: 



8890 



8891 



8892 



0.03 % 



0.02 % 



0.02 % 



0.01 % 



0.01 % 



0.01 % 



0.00 % 



0.00 % 



0.00 % 



0.00 % 



0.00 % 



HLA A24 - 9 mers 



1.35 



12150 



Score 



27 



22.5 



10.8 



9 



4.05 



3 



- 445 - 































































































































wo 2004/092360 



PCT/US2004/011710 



Maximum possible score using this molecule 



Rank 



Start position 



196 



Seauence 



SEO ID NO: 8893 



SEQ IP NO: 8894 



% of max. score 



27.55 % 






18.78 % 



12.52 % 









12.52 % 



2.70 % 






SEQ ID NO: 



SEQ ID NO: 



8899 



8900 



1.87 % 



1.12 % 



SEQ ID NO: 



SEQ ID NO: 



8901 



8902 



0.75 % 



0.70 % 



0.64 % 



SEQ ID NO: 



SEQ ID NO: 



8903 



8904 



0.52 % 



0.52 % 



SEQ ID NO: 



SEQ ID NO: 



8905 



8906 



0.46 % 



0.45 % 



SEQ ID NO: 



SEQ ID NO: 



8907 



8908 



0.45 % 



0.45 % 



SEQ ID NO: 



SEQ ID NO: 



8909 



8910 



0.45 % 



0.45 % 



SEQ ID NO: 



SEQ ID NO: 



8911 



8912 



0.42 % 



0.37 % 



1596.672 



Score 



43.2 



30 



11.2 



10.368 



.2 



6.72 



6 



• HLA A24 - 10 mers 



Maximum possible score using this molecule 



Rank 



Start position 






SEQ ID NO: 8914 



% of max. score 



7.51 % 



2.63 % 






SEQ ID NO: 8916 



SEQ ID NO: 8917 



2.25 % 



1.87 % 



SEQ ID NO: 8918 



SEO ID NO: 8919 



SEO ID NO: 8920 



0.75 % 



0.64 % 



0.63 % 



0.62 % 









0.56 % 



0.55 % 






SEQ ID NO: 8924 






0.46 % 



0.46 % 



0.45 % 



1596.672 



Score 



12 



10.368 



10.08 



10 



- 446 - 















































































































































wo 2004/092360 



PCT/US2004/011710 




SEQ ID NO: 8926 



SEQ ID NO: 8927 



SEQ ID NO: 8928 



0.45 % 



0.37 % 



0.37 % 







SEQ ID NO: 8930 



KM»H.lit»a;bkH 



SEQ ID NO: 8932 



0.35 % 



0.35 % 



0.33 % 



0.31 % 



5.6 



5.28 



5 



HLA A 0201 - 9 mers 



Maximum possible score using this molecule 



Rank I Start position 



13 



Sequence 



SEQ ID NO: 8933 



9/6 of max. score 



0.04 P/o 



3925227.1 



Score 



1793.676528 




SEQ ID NO: 8936 





SEQ ID NO: 8938 



SEQ ID NO: 8939 



SEQ ID NO: 8940 



SEQ ID NO: 8941 



SEQ ID NO: 8942 



SEQ ID NO: 8943 



SEQ ID NO: 8944 



0.00 % 



0.00 % 



0.00 % 



0.00 % 



0.00 % 



0.00 % 



0.00 % 



0.00 % 



0.00 % 



0.00 % 



0.00 % 



0.00 % 



0.00 % 



0.00 % 



0.00 % 



0.00 % 



0.00 % 



618.0996816 



223.23708 






198.30859992 



194.477328 



180.58536756 



169.74828 






43.42032 



41.792058 



37.46088108 



35.73520902 



20.5916435109 



12.233222865 



12.2199 



11.486706 



6.416172 



5.827464 



HLA A 0201 - 10 mers 



Maximum possible score using this molecule 




^ 



% of max. score 



0.10 % 



0.02 % 



0.02 % 



0.01 % 



3925227.1 



Score 



3977.8497792 



836.2525104 



815.616432 



660.3245145 




- 447 - 

















































































































































wo 2004/092360 



SEQ ID NO: 8957 
SEQ ID NO: 8958 



PCT/OS2004/011710 



SEQ ID NO: 8959 



SEQ ID NO: 8960 



SEQ ID NO: 8961 









SEQ ID NO: 8964 






SEQ ID NO: 8966 









H:t.H.l«.a:bl=bl 



SEQ ID NO: 8970 



SEQ ID NO: 8971 



SEQ ID NO: 8972 



0.00 % 



0.00 % 



251.837856 



159.9696 



0.00 % 



0.00 % 



0.00 % 



0.00 % 



0.00 % 



0.00 % 



0.00 % 



0.00 % 



0.00 % 



0.00 % 



155.245377 



141.1974531 



117.22672269 



84.55536 



65.5671672 



64.88888616 



49.13352 



26.082 



0.00 % 



0.00 % 



0.00 % 



0.00 % 



24.802259691 



21.5616168 






15.7068219 



12.3159135 



11.45624789925 



HLA A 1101 - 9 mers 



Maximurn possible score usinq this molecule 



Rank H Start position 



Sequence 



SEQ ID NO: 8973 



SEQ ID NO: 8974 



SEQ ID NO: 8975 



SEQ ID NO: 8976 



Vo of max. score 



12.5 % 



3.33 % 



3.33 % 



1.66 % 



36 



Score 



.5 



HLA A 1101 - 10 mers 



Maximum possible score usinq this molecule 



Rank 



Start position 



Sequence 



SEQ ID NO: 8977 



SEQ ID NO: 8978 



% of max. score 



5.55 % 



3.33 % 



36 



Score 



2 



HLA B7 - 9 mers 



Maximum possible score usinq this molecule 



Start position 



SEQ ID NO: 8979 



SEQ ID NO: 8980 



SEQ ID NO: 8981 



SEQ ID NO: 8982 



SEQ ID NO: 8983 



% of max. score 



0.74 % 



0.74 % 



0.66 % 



0.33 % 



0.22 % 



5400 



Score 



- 448 - 






























































































































SEQ ID NO: 



SEQ ID NO: 



8984 



8985 



0.22 % 



0.11 % 



SEQ ID NO: 



SEQ ID NO: 



8986 



8987 



0.07 % 



0.07 % 




SEQ ID NO: 



SEQ ID NO: 



SEQ ID NO: 



8988 



8989 



8990 



SEQ ID NO: 



SEQ ID NO: 



8991 



8992 



SEQ ID NO: 



SEQ ID NO: 



8993 



8994 



SEQ ID NO: 



SEQ ID NO: 



8995 



8996 



SEQ ID NO: 



SEQ ID NO: 



8997 



8998 



0.07 % 



0.07 % 



0.07 % 



0.07 % 



0.07 % 



0.07 % 



0.07 % 



0.07 % 



0.07 % 



0.07 % 



0.07 % 



HLA B7 - 10 mers 



Maximum possible score usinq this molecule 



5400 



Score 




























































































































wo 2004/092360 



PCT/US2004/011710 



Table 21: Epitopes for SEQ ID NO: 6047 



HLA A1 - 9 mers 



Maximum possible score using this molecule type 



5625 



Score 



Rank 



Start position 



53 



Sequence 



SEQ ID NO: 9019 



% of max. score 



2 % 



0.08 % 



112.5 



.5 



SEQ ID NO: 9020 



SEQ ID NO: 9021 



0.02 % 



0.00 % 



SEQ ID NO: 9022 



SEQ ID NO: 9023 



SEQ ID NO: 9024 



0.00 % 



0.00 % 



HLA A1 - 10 mers 



Maximum possible score using Jthis molecule 



5625 



Score 



Rank 



Start position 



10 



Sequence 



SEQ ID NO: 9025 



% of max. score 



0.8 % 



45 



11.25 



SEO ID NO: 9026 



SEQ ID NO: 9027 



0.2 % 



0.04 % 



SEO ID NO: 9028 



SEQ ID NO: 9029 



SEQ ID NO: 9030 



0.02 % 



0.02 % 



0.00 % 



HLA A3 - 9 mers 



Maximum possible score using this molecule 



12150 



Score 



Rank 



Start position 



Sequence 



SEQ ID NO: 9031 



°/o of max. score 



1.85 % 



SEQ ID NO: 9032 



SEQ ID NO: 9033 



0.02 % 



0.02 % 



SEO ID NO: 9034 



SEO ID NO: 9035 



0.01 % 



0.01 % 



1.8 



1.35 



SEQ ID NO: 9036 



SEQ ID NO: 9037 



0.00 % 



0.00 % 



0.675 



0.6 



HLA A3 - 10 mers 



Maximum possible score using this molecule 



12150 



Score 



Rank 



Start position 



13 



37 



Sequence 



% of max. score 



SEO ID NO: 9038 



0.04 % 



0.01 % 



6 



2.025 



SEQ ID NO: 9039 



^ 50 - 













































































































wo 2004/092360 



SEQ ID NO: 9040 
SEQ ID NO: 9041 



PCT/US2004/011710 



SEQ ID NO: 9042 



0.00 % 



0.00 % 



0.00 % 



HLA 



Maximum possible score using this 



A24 - 9 mers 



molecule 



Rank 



Start position 



20 



SEQ ID 



SEQ ID 



NO: 9043 



NO: 9044 



of max. score 



1.25 % 



SEQ ID 



SEQ ID 



SEQ ID 



NO: 9045 



NO: 9046 



0.52 % 



0.51 % 



SEQ ID 



SEQ ID 



NO: 9047 



NO: 9048 



0.36 % 



0.35 % 



SEQ ID 



SEQ ID 



NO: 9049 



NO: 9050 



NO: 9051 



0.27 % 



0.26 % 



0.21 % 



0.11 % 



■a KfcM 






SEQ ID 



SEQ ID 



SEQ ID 



NO: 9054 



NO: 9055 



NO: 9056 






SEQ ID 



SEQ ID 



SEQ ID 



NO: 9058 



NO: 9059 



SEQ ID 



SEQ ID 



NO: 9060 



NO: 9061 



NO: 9062 



0.10 % 



0.09 % 



0.09 % 



0.09 % 



0.08 % 



0.07 % 



0.06 % 



0.06 % 



0.05 % 



0.04 % 



0.04 % 



HLA A24 - 10 mers 



Maximum possible score using this molecule 



Rank 



Start position 






SEQ ID NO: 9063 



SEQ ID NO: 9064 



% of max. score 



2.63 % 






0.54 % 



0.52 % 



SEQ ID NO: 9066 



SEQ ID NO: 9067 



0.50 % 



0.33 % 



SEO ID NO: 9068 



SEQ ID NO: 9069 



0.26 % 



0.21 % 



0.675 



0.54 



1596.672 



Score 



8.25 



5.76 



5.6 



3.36 



1.8 



1.68 



0.9 



0.792 



0.792 



1596.672 



Score 



42 



8.64 



3.36 



8 



26 



SEQ ID NO: 9070 



0.15 % 



2.52 



































































































































SEQ ID NO: 



SEQ ID NO: 



9072 



9073 



SEQ ID NO: 



SEQ ID NO: 



9074 



9075 



SEQ ID NO: 



SEQ ID NO: 



9076 



9077 



0.13 % 



0.12 % 



0.11 % 



0.09 % 



0.09 % 



0.09 % 



SEQ ID NO: 



SEQ ID NO: 



9078 



9079 



SEQ ID NO: 



SEQ ID NO: 



SEQ ID NO: 



9080 



9081 



9082 



0.06 % 



0.04 % 



0.04 % 



0.04 % 



0.75 



0.72 



0.04 % 



0.03 % 



0.72 



0.66 



0.6 



HLA A 0201 - 9 mers 



Maximum possible score using this molecule 



Rank 







% of max. score 



SEQ ID NO: 9083 II 0.00 % 



SEQ ID NO: 9084 0.00 % 



SEQ ID NO: 9085 I 0.00 % 



3925227.1 



Score 



14.1442686 



9.598176 



9.5634 




SEQ ID NO: 9086 



SEQ ID NO: 9087 



SEQ ID NO: 9088 



SEQ ID NO: 9089 



SEQ ID NO: 9090 



SEQ ID NO: 9091 



SEQ ID NO: 9093 



SEQ ID NO: 9094 



SEQ ID NO: 9095 



0.00 % 



0.00 % 



0.00 % 



0.00 % 



0.00 % 



0.00 % 



0.00 % 



0.00 % 



0.00 % 



0.00 % 









4.88163753 






2.29699206 



2.0016040674 



0.91287 



0.71805678 



0.6694257042 



0.6539828625 



HLA A 0201 - 10 mers 



Maximum possible score using this molecule 



Rank 




Start position 



16 



19 



14 



27 



26 



34 



SEQ ID NO: 9096 



SEQ ID NO: 9097 



SEQ ID NO: 9098 



SEQ ID NO: 9099 



SEQ ID NO: 9100 



SEQ ID NO: 9101 



% of max. score 



0.00 % 



0.00 % 



0.00 % 



0.00 % 



0.00 % 



0.00 % 



3925227.1 



Score 



34.28765802 



18.9368775 



14.1442686 



11.406528 



10.9304361558 



5.580927 



- 452 - 



































































































































SEQ ID NO: 9102 



SEQ ID NO: 9103 



0.00 % 



0.00 % 



4.865742 



2.64106953 



SEQ ID NO: 9104 



SEQ ID NO: 9105 



SEQ ID NO: 9106 



0.00 % 



0.00 % 



2.6275752 



2.29699206 



0.00 % 



0.00 % 



0.86083641 



0.7049592 



0.6628440357 



0.6530644656 



EStKjjttgijiiig 



SEQ ID NO: 9108 



SEQ ID NO: 9109 



0.00 % 



0.00 % 



HLA A 1101 - 9 mers 



Maximum possible score usinq this molecule 



Rank 



Start position 



Sequence 



% of max. score 



Score 



SEQ ID NO: 9110 



SEQ ID NO: 9111 



15% 



2.22 % 



HLA A 1101 - 10 mers 



Maximum possible score using this molecule 



Rank 



Start position 



Sequence 



°/o of max. score 



36 



Score 



SEQ ID NO: 9112 



7.5 % 



HLA B7 - 9 mers 



Maximum possible score using this molecule 



Rank 



Start position 



35 



Sequence 



SEQ ID NO: 9113 



% of max. score 



3.70 % 



5400 



Score 



SEQ ID NO: 



SEQ ID NO: 



9114 



9115 



0.11 % 



0.07 % 



SEO ID NO: 



SEQ ID NO: 



9116 



9117 



0.07 % 



0.07 % 



SEQ ID NO: 



SEQ ID NO: 



9118 



9119 



0.07 % 



0.03 % 



SEQ ID NO: 



SEQ ID NO: 



SEO ID NO: 



9120 



9121 



9122 



0.02 % 



0.02 % 



0.01 % 



HLA B7 - 10 mers 



Maximum possible score using this molecule 



Rank 



Start position 



Sequence 



*Vo of max. score 



5400 



Score 



5 



SEQ ID NO: 9123 



SEQ ID NO: 9124 



0.09 % 



0.07 % 



- 453 - 












































































































wo 2004/092360 



PCT/US2004/011710 



SEO ID NO: 



SEP ID NO: 



SEO ID NO: 



9125 



9126 



9127 



SEO ID NO: 



SEO ID NO: 



9128 



9129 



SEO ID NO: 



SEO ID NO: 



9130 



9131 



0.07 % 



0.07 % 



0.03 % 



0.03 % 



0.02 % 



0.02 % 



0.01 % 



Table 22: Epitopes for SEQ ID NO: 6 



HLA A1 - 9 mers 



Maximum possible score using this molecule type 



Rank 



Start position 



Sequence 



SEO ID NO: 9132 



SEO ID NO: 



SEO ID NO: 



9133 



9134 



SEQ ID NO: 



SEO ID NO: 



9135 



9136 



SEO ID NO: 



SEO ID NO: 



9137 



9138 



SEO ID NO: 



SEO ID NO: 



SEO ID NO: 



9139 



9140 



9141 






SEO ID NO: 9143 



<*/o of max. score 



0.44 % 



0.08 % 



0.04 % 



0.04 % 



0.04 % 



0.01 % 



0.00 % 



0.00 % 



0.00 % 



0.00 % 



0.00 % 



0.00 % 



HLA A1 - 10 mers 



Maximum possible score using this molecule 



Rank 



Start position 



Sequence 



SEQ ID NO: 



SEQ ID NO: 



9144 



9145 



SEO ID NO: 



SEQ ID NO: 



9146 



9147 



SEO ID NO; 



SEO ID NO; 



9148 



9149 



SEO ID NO: 



SEQ ID NO: 



SEQ ID NO: 



9150 



9151 



9152 



% of max. score 



0.4% 



0.12 % 



0.04 % 



0.02 % 



0.01 % 



0.01 % 



0.01 % 



0.00 % 



0.00 % 



5625 



Score 



2.5 



2.25 



5625 



Score 



22.5 



0.9 



0.75 



0.675 



0.5 



0.5 




- 454 - 


















































































































SEQ ID NO: 9153 



SEQ ID NO: 9154 



SEQ ID NO: 9155 



0.00 % 



0.00 % 



0.00 % 



0.00 % 



0.00 % 



SEQ ID NO: 9156 



SEQ ID NO: 9157 



HLA A3 - 9 mers 



Maximum possible score using this moiecule 



12150 



Score 



90 



Rank 



Start position 



109 



Seouence 



SEQ ID NO: 9158 



% of max. score 



0.74 % 



SEQ ID NO: 



SEQ ID NO: 



9159 



9160 



0.24 % 



0.12 % 



SEQ ID NO: 



SEQ ID NO: 



9161 



9162 



0.07 % 



0.05 % 



6.075 



6 



SEQ ID NO: 



SEQ ID NO: 



9163 



9164 



0.04 % 



0.02 % 



SEQ ID NO: 



SEQ ID NO: 



9165 



9166 



0.02 % 



0.02 % 



SEQ ID NO: 



SEQ ID NQ: 



9167 



9168 



0.01 % 



0.01 % 




SEQ ID NO: 



SEQ ID NQ: 



9169 



9170 



0.01 % 



0.01 % 



SEQ ID NO: 



SEQ ID NO: 



SEQ ID NO: 



9171 



9172 



0.01 % 



0.00 % 



0.00 % 



1.215 



0.9 



9173 



9174 



SEQ ID NO: 



SEQ ID NO: 



SEQ ID NO: 



0.00 % 



0.00 % 



0.00 % 



9175 



9176 



0.54 



0.54 



HLA A3 - 10 mers 



Maximum possible score using this moiecuie 



12150 



Score 



Rank 



Start position 



Sequence 



SEQ ID NO: 9177 



% of max. score 



0.49 % 



SEQ ID NO: 9178 



SEQ ID NO: 9179 



0.29 % 



0.14 % 



SEQ ID NQ: 9180 



SEQ ID NQ: 9181 



0.04 % 



0.01 % 



2.25 



1.8 



1.35 



SEQ ID NQ: 9182 



SEQ ID NQ: 9183 



0.01 % 



0.01 % 



- 455 - 



































































































































SEO ID NO: 



SEO ID NO: 



9184 



9185 



SEO ID NO: 



SEO ID NO: 



9186 



9187 



SEO ID NO: 



SEQ ID NO: 



9188 



9189 



SEO ID NO: 



SEO ID NO: 



SEQ ID NO: 



9190 



9191 



9192 



0.00 % 



0.00 % 



0.00 % 



0.00 % 



0.00 % 



0.00 % 



0.00 % 



0.00 % 



0.00 % 



HLA 



Maximum possible score using this 



A24 - 9 mers 



molecule 



Rank 



Start position 






SEQ ID NO: 9194 






SEQ ID 



SEQ ID 



NO: 9196 



NO: 9197 



SEQ ID 



SEQ ID 



NO: 9198 



NO: 9199 



SEQ ID 



SEQ ID 



NO: 9200 



NO: 9201 






SEQ ID 



SEQ ID 



NO: 9203 



NO: 9204 






SEO ID NO: 9206 






SEQ ID 



SEO ID 



SEO ID 



NO: 9208 



NO: 9209 



SEQ ID 



SEQ ID 



NO: 9210 



NO: 9211 



NO: 9212 



of max. score 



1.66 % 



0.77 % 



0.56 % 



0.56 % 



0.54 % 



0.54 % 



0.46 % 



0.45 % 



0.42 % 



0.41 % 



0.37 % 



0.37 % 



0.25 % 



0.25 % 



0.22 % 



0.18 % 



0.18 % 



0.18 % 



0.18 % 



0.15 % 



HLA A24 - 10 mers 



Maximum possible score using this molecule 



0.9 



0.81 



0.75 



0.6 



0.54 



1596.672 



Score 



26.6112 



12.32 



8.64 



8.64 



6.72 



Rank 



Start position 



Sequence 



% of max. score 






SE 



SEQ ID NO: 9214 



0.93 % 



0.78 % 



1596.672 



Score 



15 



12.6 



- 456 - 
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■cbToliilgliSaM 





















E 5 ^n» igr »*yKi 






0.52 % 



0.46 % 



0.45 % 



0.42 % 



0.41 % 



0.37 % 



0.37 % 



0.37 % 



0.37 % 



0.37 % 



0.25 % 



0.22 % 






SEQ ID NO: 9229 



SEO ID NO: 9230 






SEQ ID NO: 9232 



0.22 % 



0.18 % 



0.18 % 



0.12 % 



0.12 % 



0.11 % 



.2 



6.72 





HLA A 0201 - 9 mers 



Maximum possible score usinq this molecule tvoe 



Rank 
























idaaiJkmwi 



SEQ ID NO: 9240 












id^<ajffifckggi 









K ggn .igi^yghi 









% of max. score 



0.01 % 



0.00 % 



0.00 % 



0.00 % 



0.00 % 



0.00 % 



0.00 % 



0.00 % 



0.00 % 



0.00 % 



0.00 % 



0.00 % 



0.00 % 



0.00 % 



0.00 % 



0.00 % 



0.00 % 



3925227.1 



Score 



441.342216 



63.16728165 



51.882640425 



43.8816609 



33.40165248 



24.66305226 



22.64458905 



20.70206586 



11.175953184 



8.452983 



8.1793152 



4.993461 



4.57284528 



3.999468528 



2.231322 



1.3524 



0.8631693 



^ 57 - 
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Maximum possible 



HLA A 0201 - 10 mers 



score usinq this molecule 



Rank 









% of max. score 



0.03 % 



3925227.1 



Score 






0.01 % 



0.00 % 








SEQ ID NO: 9258 



SEQ ID NO: 9259 



0.00 % 



0.00 % 



0.00 % 



0.00 % 



1243.078056 



592.944462 



94.2678 



43.42032 



33.30332334 



32.53181778 



SEQ ID NO: 9260 



SEQ ID NO: 9261 



0.00 % 



0.00 % 



27.324 



21.3624 



13.723479 



IMj.ll.lJl.ll.-VJJJ 



IIESUii^lsBsh^ 



0.00 % 



0.00 % 






13.00344192 



10.01276388 




HLA A 1101 - 9 mers 



Maximum possible score usinq this molecule 



Rank 



Start position 



Sequence 



SEQ ID NO: 9273 



SEQ ID NO: 9274 



SEQ ID NO: 9275 



% of max. score 



3.33 % 



1.66 % 



1.66 % 



Score 



HLA A 1101 - 10 mers 



Maximum possible score usinq this molecule 




























































































































HLA B7 - 9 mers 



Maximum possible score using this molecule 



5400 



Score 



Rank 



Start position 



22 



Sequence 



% of max. score 



3.70 % 



SEO ID NO: 



SEO ID NO: 



SEO ID NO: 



SEQ ID NO: 



9277 



9278 



9279 



9280 




2.22 % 



0.22 % 



0.11 % 



0.07 % 



SEO ID NO: 



SEO ID NO: 



9281 



9282 



0.07 % 



0.07 % 



SEO ID NO: 



SEO ID NO: 



9283 



9284 



0.07 % 



0.07 % 



0.03 % 



SEO ID NO: 



SEO ID NO: 



9285 



9286 



SEO ID NO: 



SEO ID NO: 



9287 



9288 



0.03 % 



0.03 % 



SEO ID NO: 



SEO ID NO: 



9289 



9290 



0.02 % 



0.01 % 



SEO ID NO: 



SEO ID NO: 



9291 



9292 



0.01 % 



0.01 % 




SEO ID NO: 



SEO ID NO: 



9293 



9294 



0.01 % 



0.00 % 



HLA B7 - 10 mers 



Maximum possible score using this molecule 



5400 



Score 



Rank 



Start position 



46 



98 



Sequence 



SEO ID NO: 9295 



% of max. score 



1.48 % 



SEO ID NO: 



SEO ID NO: 



9296 



9297 



1.48 % 



0.37 % 



SEO ID NO: 



SEO ID NO: 



9298 



9299 



0.37 % 



0.07 % 



SEO ID NO: 



SEQ ID NO: 



9300 



9301 



0.07 % 



0.07 % 





SEQ ID NO: 



SEQ ID NO: 



9302 



9303 



0.03 % 



0.02 % 



SEQ ID NO; 



SEQ ID NO: 



9304 



9305 



0.02 % 



0.01 % 



SEO ID NO: 



SEQ ID NO: 



9306 



9307 



0.01 % 



0.01 % 



^ 59 - 
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SEO ID NO: 9332 



SEP IP NO: 9333 



SEO ID NO: 9334 



SEO ID NO: 9335 



SEO ID NO: 9336 



0.00 % 



0.00 % 



0.00 % 



0.00 % 



0.00 % 



0.9 



0.81 



0.81 



0.6 



HLA A3 - 10 mers 



Maximum possible score using this molecule 



Rank 



Start position 



20 



Sequence 



SEO ID NO: 9337 



SEO ID NO: 9338 



SEO ID NO: 9339 



SEO ID NO: 9340 



SEO ID NO: 9341 



SEQ ID NO: 9342 






SEO ID NO: 9344 



SEQ ID NO: 9345 



SEQ ID NO: 9346 



SEQ ID NO: 9347 



■d=l>BiaJt4Kggni 



o/o of max. score 



0.16 % 



0.09 % 



0.07 % 



0.07 % 



0.06 % 



0.03 % 



0.03 % 



0.02 % 



0.00 % 



0.00 % 



0.00 % 



0.00 % 



HLA A24 - 9 mers 



Maximum possible score using this molecule 



Rank 



Start position 



SEQ ID NO: 9349 

































SE 



SEQ ID NO: 9360 









9A» of max. score 



18.78 % 



1.87 % 



1.50 % 



0.75 % 



0.56 % 



0.46 % 



0.37 % 



0.36 % 



0.35 % 



0.35 % 



0.30 % 



0.26 % 



0.25 % 



0.22 % 



12150 



Score 



20.25 



4.05 



4.05 



0.6 



0.54 



0.54 



1596.672 



Score 



6 



5.76 



461 - 
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SEO ID NO: 9363 



SEO ID NO: 9364 



SEO ID NO: 9365 



SEO ID NO: 9366 



SEO ID NO: 9367 



■aa>Bini?[WJ6w;M 



0.18 % 



0.18 % 



0.15 % 



0.13 % 



0.06 % 



0.05 % 



HLA A24 - 10 mers 



Maximum possible score using this molecule 



Rank 



Start position 



8 






SEO ID NO: 9370 





















E^ITagL»»J^yj:] 



V TT a ? [»jcW^] 



SEQ ID NO: 9380 









■d=^«njjt>ai)6?ag 



SEQ ID NO: 9384 






% of max. score 



22.54 % 



1.2.5 



0.65 % 



0.52 % 



0.45 % 



0.37 % 



0.36 % 



0.30 % 



0.28 % 



0.26 % 



0.25 % 



0.18 % 



0.18 % 



0.15 % 



0 . 10 % 



0.07 % 



0.06 % 



0.05 % 



0.05 % 



1596.672 



Score 



10.5 



8.4 



6 



5.76 



.4.8 



0.9 



0.84 



HLA A 0201 - 9 mers 



Maximum possible score using this molecuie type 



Rank 






Sequence 


















idd>naj[f»ia5gii 



% of max. score 



0.10 % 



0.03 % 



0.01 % 



0.00 % 



0.00 % 



3925227.1 



Score 



4267.988928 



1360.69088544 



569.948832 



309.0498408 



79.73570448 



51.109542 



- 462 - 
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SEQ ID NO: 9394 



SEQ ID NO: 9395 










0.00 % 



0.00 % 



0.00 % 



0.00 % 



0.00 % 



45.25539984 



34.28765802 



26.532116325 



25.26691266 



4.72873208445 







SEQ ID NO: 9400 






SEQ ID NO: 9402 



SEQ ID NO: 9403 



0.00 % 



0.00 % 



0.00 % 



0.00 % 



2.638538265 



2.4274552038 



1.7415324 



SEQ ID NO: 9404 



SEQ ID NO: 9405 



SEQ ID NO: 9406 



0.00 % 



0.00 % 



1.6025526 



1.453803297 



0.00 % 



0.00 % 



1.36878336 



0.824619 



0.513774 



HLAA 



Maximum possible score usinq this 



020.1 - 10 mers 



molecule 



Rank 










SEQ ID NO: 



SEQ ID NO: 



SEQ ID NO: 



9407 



9408 



% of max. score 



0.09 % 




SEQ ID NO: 



SEQ ID NO: 



9409 



9410 



9411 



0.02 % 



0.02 % 



Score 



3636.068421648 



1107.960876 



0.00 % 



0.00 % 



836.2525104 



150.9313176 



76.55002416 




SEQ ID NO: 



SEQ ID NO: 



SEQ ID NO: 



9412 



9413 



9414 






0.00 % 



0 . 00 ,% 



0.00 % 



0.00 % 



49.0273014 



42.1638414747 



9.29480508 



9.2669346 



SEQ ID NO: 



SEQ ID NO: 



9416 



9417 



0.00 % 



0.00 % 







SEQ ID NO: 



SEQ ID NO: 



SEQ ID NO: 



9418 



9419 



9420 



0.00 % 



0.00 % 






0.00 % 



0.00 % 









0.00 % 



0.00 % 



5.051306761875 



2.6941464 



2.3839914 



1.465422 



0.70794 



0.6513048 



0.51882640425 




- 463 - 
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II HLA A 1101 - 10 mers 


iMaximum possible score using this molecule type 


36 


Rank 


Start position 


Sequence 


% of max. score 


Score 


_1 


33 


SEQ ID NO: 9424 


1.66 % 


0.6 



HLA B7 - 9 mers 


Maximum possible score usinq this molecule type || 5400 


Rank 


Start position | Sequence 


°/o of max. score Score 


1 


13 II SEQ ID NO: 9425 


0.22 % 12 


2 


2 1 SEQ ID NO: 9426 


0.07 % 4 


3 


9 i SEQ ID NO: 9427 


0.07 % 4 


4 


16 II SEQ ID NO: 9428 


0.07 % 


4 


5 


23 II SEQ ID NO: 9429 


0.07 % 


4 


6 


5 11 SEQ ID NO: 9430 


0.02 % 


1.2 


1 7 1 15 1 SEQ ID NO: 9431 | 


0.01 % 


1 



1 HLA B7 - 10 mers HI 


iMaximum possible score using this molecule type 




T 


5400 i 


Rank 


Start position 


Sequence 


% of max. score 


1 score i 


1 


4 


SEQ ID NO: 9432 


0.07 % 


I 


4 1 


2 


10 


SEQ ID NO: 9433 


0.07 % 


I 


4 1 


3 


12 


SEQ ID NO: 9434 


0.07 % 


I 


4 1 


4 


15 


SEQ ID NO: 9435 


0.07 % 




4 1 


5 


22 


SEQ ID NO: 9436 


0.07 % 


I 


4 1 


6 


13 


SEQ ID NO: 9437 


0.02 % 


I 


1.2 1 



5 Table 24: Epitopes for SEQ ID NO: 6050 



1 HLA A1 - 9 mers 


Maximum possible score using this molecule type 


5625 


Rank 


Start position 


Sequence 


% of max. score 


Score 


1 


47 


SEQ ID NO: 9438 


0.01 % 


0.75 


2 


21 


SEQ ID NO: 9439 


0.00 % 


0.5 


3 


53 


SEQ ID NO: 9440 


0.00 % 


0.5 



HLA A1 - 10 mers 




Maximum possible score using this molecule type 


5625 


Rank || Start position || Sequence II % of max. score 


Score 



- 464 - 















































































SEO ID NO: 9441 



SEQ ID NO: 9442 



SEO ID NO: 9443 



SEO ID NO: 9444 



SEO ID NO: 9445 



SEO ID NO: 9446 



0.04 % 



0.04 % 



0.02 % 



0.01 % 



0.00 % 



0.00 % 



HLA A3 - 9 mers 



Maximum possible score usinq this molecule 



Rank 



Start position 



54 



Sequence 



SEO ID NO: 9447 



SEO ID NO: 9448 



SEO ID NO: 9449 



% of max. score 



0.02 % 



0.01 % 



0.01 % 



12150 



Score 



HLA A3 - 10 mers 



Maximum possible score usinq this molecule 



Rank 



Start position 



Sequence 



SEQ ID NO: 9450 



SEQ ID NO: 9451 



SEQ ID NO: 9452 



% of max. score 



0.09 % 



0.01 % 



0.00 % 



12150 



Score 



12 



HLA A24 - 9 mers 



Maximum possible score usinq this molecule 



Rank 



Start position 



70 






SEQ ID NO: 9454 















SEO ID NO: 9459 



SEQ ID NO: 9460 



SEO ID NO: 9461 









SEQ ID NO: 9464 






% of max. score 



2.10 % 



1.12 % 



0.46 % 



0.37 % 



0.31 % 



0.30 % 



0.30 % 



0.25 % 



0.25 % 



0.25 % 



0.18 % 



0.06 % 



0.03 % 



1596.672 



Score 



33.6 



18 





HLA A24 - 10 mers 



- 465 - 














































































































wo 2004/092360 



PCT/US2004/011710 




Maximum possible score using this molecule 



Rank 



Start position 



14 



■■■lj.ll.UI.ll.-gH.-lll 



<Vb of max. score 



12.52 % 



1596.672 



Score 



200 



SEO ID NO: 9467 



SEO ID NO: 9468 



SEO ID NO: 9469 



SEO ID NO: 9470 



SEQ ID NO: 9471 



0.93 % 



0.75 % 



0.56 % 






0.45 % 



0.45 % 



0.37 % 



■.Hj.ll.Hl.a»gH»«ll 



SEQ ID NO: 9474 



0.35 % 



0.25 % 



■■■ij.ii.iJi.SigM.-aii 



0.12 % 



0.07 % 






SEQ ID NO: 9478 



SEQ ID NO: 9479 



0.06 % 



0.04 % 



0.75 



0.72 



Maximum possible score 



HLA A 0201 - 9 mers 



using this molecule 



Rank 







SEO ID NO: 9480 



SEQ ID NO: 9481 






% of max. score 



0.02 % 



0.00 % 



0.00 % 



SEO ID NO: 9484 



SEQ ID NO; 9485 






SEQ ID NO: 9488 



SEO ID NO: 9489 



SEQ ID NO: 9490 



SEQ ID NO: 9491 



SEQ ID NO: 9492 



0.00 % 



0.00 % 



3925227.1 



Score 



881.199 



95.013 



93.69648 



40.2894864 



0.00 % 



0.00 % 



0.00 % 



0.00 % 



O'. 00 % 



0.00 % 



0.00 % 



0.00 % 



12.6684 



10.34586 



3.3704706 



1.656 



1.47537042 



0.966 



0.942678 



0.7669695 



0.52661835 
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HLA A 1101 - 9 mers 



Maximum possible score using this molecule 



Rank 



Start position 



17 



Sequence 



SEQ ID NO: 9505 



% of max. score 



2.22 % 



HLA A 1101 - 10 mers 



Maximum possible score using this molecule 



Rank H Start position 



16 



Sequence 



SEQ ID NO: 9506 



% of max. score 



5.55 % 




===== HLA B7 - 9 mers 



Maximum possible score using this molecule 



Rank 




Start position 



27 



54 



70 



67 



12 



15 



19 



49 



69 



47 



5 



9 



35 



37 




Sequence 



SEQ ID NO: 9507 



SEQ ID NO: 9508 



SEQ ID NO: 9509 



SEQ ID NO: 9510 



SEQ ID NO: 9511 



SEQ ID NO: 9512 



SEQ ID NO: 9513 



SEQ ID NO: 9514 



SEQ ID NO: 9515 



SEQ ID NO: 9516 



SEQ ID NO: 9517 



SEQ ID NO: 9518 



SEQ ID NO: 9519 



SEQ ID NO: 9520 



SEQ ID NO: 9521 



% of max. score 



0.37 % 



0.22 % 



0.22 % 



0.11 % 



0.07 % 



0.07 % 



0.07 % 



0.03 % 



0.03 % 



0.02 % 



0.01 % 



0.01 % 



0.01 % 



0.01 % 



0.01 % 



5400 



Score 



20 



12 



12 



6 
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HLA B7 - 10 mers 



Maximum possible score using this molecule 



Start position 




Sequence 



SEQ ID NO: 9522 



SEQ ID NO: 9523 



SEQ ID NO: 9524 



SEQ ID NO: 9525 



SEQ ID NO: 9526 



SEQ ID NO: 9527 



SEQ ID NO: 9528 



SEQ ID NO: 9529 



SEQ ID NO: 9530 



SEQ ID NO: 9531 



SEQ ID NO: 9532 






SEQ ID NO: 9534 



SEQ ID NO: 9535 






SEQ ID NO: 9537 



SEQ ID NO: 9538 



”/o Of max. score 



0 . 66 % 



0.22 % 



0.13 % 



0.11 % 



0.07% 



0.07 % 



0.07 % 



0.02 % 



0.01 % 



0.01 % 



0.01 % 



0.01 % 



0.01 P/o 



0.01 % 



0.00 % 



0.00 % 



0.00 % 




Table 25: Epitopes for SEQ ID NO: 6052 




HUV A1 - 9 mers 



Sequence 



SEQ ID NO: 9539 



SEQ ID NO: 9540 



SEQ ID NO: 9541 



SEQ ID NO: 9542 



SEQ ID NO: 9543 



SEQ ID NO: 9544 



SEQ ID NO: 9545 



SEQ ID NO: 9546 



SEQ ID NO: 9547 



SEQ ID NO: 9548 



SEQ ID NO: 9549 



SEQ ID NO: 9550 



°/o of max. score 



0.8 % 



0.44 % 



0.32 % 



0.17 % 



0.17 % 



0.16 % 



^ 0.11 % 



0.10 % 



0.08 % 



0.04 % 



0.04 % 



0.04 % 



5625 



Score 



45 



25 



18 



10 



10 



9 



6.25 



6 



5 



2.25 



2.25 



2.25 
































































SEO ID NO: 



SEQ ID NO: 



9551 



9552 



SEO ID NO: 



SEO ID NO: 



9553 



9554 



SEO ID NO: 



SEQ ID NO: 



9555 



9556 



SEO ID NO: 



SEQ ID NO: 



9557 



9558 



0.02 % 



0.02 % 



0.02 % 



0.02 % 



0.01 % 



0.01 % 



0.01 % 



0.01 % 



1.25 



1.25 



0.9 



0.75 



0.75 



0.625 




Maximum possible score 



Rank n Start position 



HLA A1 - 10 mers 



using this molecule 



5625 



Sequence 



SEQ ID NO: 9559 



% of max. score 



1.11 % 



SEQ ID NO: 



SEO ID NO: 



9560 



9561 



0.8 % 



0.8 % 



62.5 



45 




SEQ ID NO: 



SEQ ID NO: 



9562 



9563 



0.66 % 



0.44 % 



45 



37.5 



SEQ ID NO: 



SEQ ID NO: 



9564 



9565 



0.22 % 



0.17 % 



25 



12.5 



SEQ ID NO: 



SEQ ID NO: 



9566 



9567 



0.17 % 



0.11 % 



SEQ ID NO: 



SEQ ID NO: 



SEO ID NO: 



9568 



9569 



9570 



0.11 % 



0.04 % 



6.25 



6.25 









0.04 % 



0.04 % 



0.03 % 




SEQ ID NO: 9573 



SEQ ID NO: 9574 



SEQ ID NO: 9575 



SEQ ID NO: 9576 



0.02 % 



0.01 % 



1.8 



1.125 



SEQ ID NO: 9577 



SEQ ID NO: 9578 



0.01 % 



0.01 % 



0.01 % 




HLA A3 - 9 mers 



Maximum possible score using this molecule 



Rank 



Start position 



Sequence 



SEO ID NO: 9579 



SEO ID NO: 9580 



SEO ID NO: 9581 



% of max. score 



0.49 % 



0.27 % 



0.16 % 



12150 



Score 



60 



33.75 



20 
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SEO ID NO; 



SEQ ID NO: 



9582 



9583 



0.09 % 



0.04 % 



SEO ID NO: 



SEO ID NO: 



SEO ID NO: 



SEO ID NO: 



9584 



9585 



9586 



9587 



0.04 % 



0.04 % 



SEO ID NO: 



SEQ ID NO: 



SEQ ID NO: 



9588 



9589 



9590 



0.01 % 



0.01 % 



0.01 % 



0.01 % 



0.01 % 



■.Hj.ll.UI.lH.-feH 



SEQ ID NO: 



SEO ID NO: 



9592 



9593 



0.01 % 



0.00 % 



SEQ ID NO: 



SEQ ID NO: 



9594 



9595 



0.00 % 



0.00 % 



SEO ID NO: 



SEQ ID NO: 



SEQ ID NO: 



9596 



9597 



9598 



0.00 % 



0.00 % 



0.00 % 



0.00 % 




HLA A3 - 10 mers 



Maximum possible score usinq this molecule 



Rank 




Start position 



338 



Sequence 



SEO ID NO: 9599 



9/6 of max. score 



0.49 % 



SEQ ID NO: 



SEQ ID NO: 



9600 



9601 



0.32 % 



0.24 % 



SEQ ID NO: 



SEO ID NO: 



9602 



9603 



0.18 % 



0.13 % 



SEQ ID NO: 



SEQ ID NO: 



9604 



9605 



0.07 % 



0.07 % 



SEQ ID NO: 



SEQ ID NO: 



9606 



9607 



0.04 % 



0.03 % 



SEQ ID NO: 



SEQ ID NO: 



9608 



9609 



0.02 % 



0.02 % 



SEO ID NO: 



SEQ ID NO: 



9610 



9611 



0.01 % 



0.01 % 



SEQ ID NO: 



SEO ID NO: 



9612 



9613 



0.01 % 



0.01 % 



SEO ID NO: 



SEQ ID NO: 



SEQ ID NO: 



9614 



9615 



9616 



0.01 % 



0.00 % 



0.00 % 




0.9 



0.675 



12150 



Score 



30 



22.5 



4.05 



3 
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SEQ ID NO: 9617 



SEQ ID NO: 9618 



0.00 % 



0.00 % 



HLA A24 - 9 mere 



Maximum possible score using this molecule 



Rank B Start position | Sequence 



345 li SEQ ID NO: 9619 



2 B 306 B SEP ID NO: 9620 



3 B 222 II SEP ID NO: Qfi71 



111 



5 I 159 



® II 219 g SEQ ID NO: 9624 



283 



8 B 266 












SEQ ID NO: 9627 



SEQ ID NO: 9628 



SEQ ID NO: 9629 



SEQ ID NO: 9630 




SEQ ID NO: 9635 



SEQ ID NO: 9636 



SEQ ID NO: 9637 



SEQ ID NO: 9638 



% of max. score 



1.50 % 



0.75 % 



0.54 % 



0.51 % 



0.45 % 



0.45 % 



0.45 % 



0.42 % 



0.41 % 



0.37 % 



0.37 % 



0.37 % 



0.31 % 



0.31 % 



0.30 % 



0.30 % 



0.30 % 



0.30 % 



0.30 % 



0.30 % 



• HLA A24 - 10 mere 



Maximum possible score using this molecule 



Rank B Start position B Sequence 



65 B SEP ID NO: 9639 



2 B 306 B SEP ID NO: 9640 



3 II 95 B SEP ID NO: 9641 



36 



5 II 385 II SEQ ID NO: 9643 



6 II 111 « SEQ ID NO: 9644 



SEQ ID NO: 9645 



8 



9 II 221 II SEQ ID NO: 9647 



% of max. score 



0.93 % 



0.75 % 



0.66 % 



0.60 % 



0.50 % 



0.46 % 



0.45 % 



0.45 % 



0.45 % 



1596.672 



Score 



24 



12 



8.64 



8.25 



.2 



.2 



.2 



6.72 



6.6 



6 



. 6 



6 



5 



5 



4.8 



4.8 



4.8 



;4.8 



4.8 



4.8 



1596.672 



Score 



15 



12 



10.56 



9.6 



8 
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SEQ ID NO: 9649 






■ 5r= CT T >l ? PJ gg kfl 









SEO ID NO: 9654 






SEO ID NO: 9656 






SEQ ID NO: 9658 



0.45 % 



0.37 % 



0.37 % 



0.37 % 



0.37 % 



0.37 % 



0.34 % 



0.31 % 



0.30 % 



0.30 % 



0.30 % 




Maximum possible 



HLA A 0201 - 9 mers 



score usinq this molecule 



Rank 






I.HJ.ll.lJI.H.-I.H.-bl 






SEO ID NO: 9661 









SEQ ID NO: 9664 






SEO ID NO: 9666 












SEO ID NO: 9670 















SEQ ID NO: 9675 



fcT g ar»m» T gi;w a 



SEQ ID NO: 9677 



SEQ ID NO: 9678 



<Vb of max. score 



0.03 % 



3925227.1 



Score 






0.00 % 



0.00 % 



0.00 % 



0.00 % 



50.232 



31.24872 



0.00 % 



0.00 % 



13.6235739 



11.4264 



0.00 % 



0.00 % 



8.14407 



7.0518 



0.00 % 



0.00 % 



5.112072 



3.00547233 



0.00 % 



0.00 % 



2.59578 



2.412585 



0.00 % 



0.00 % 



2.103255861 



1.86369 



0.00 % 



0.00 % 



1.646892 



1.614048 



0.00 % 



0.00 % 



0.00 % 



1.442246832 



1.319625 



1.233099 



0.966 



HLA A 0201 - 10 mers 



Maximum possible score using this molecule type 






% of max. score 



3925227.1 



Score 
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SEO ID NO: 9681 



0.00 



0.00 



0.00 



0.00 






SEQ ID NO: 9684 


















SEQ ID NO: 9690 



SEO ID NO: 9691 









SEO ID NO: 9694 












SEQ ID NO: 9698 



0.00 



0.00 



0.00 



0.00 



0.00 



0.00 



0.00 



0.00 



0.00 



0.00 



0.00 



0.00 



0.00 



0.00 



0.00 



0.00 



309.0498408 



98.26704 



98.26704 



36.31608 



24.0754248 



12.2199 



10.467576 



7.7274204 



4.296699 



4.286487 



4.054785 



3.883803 



3.428691903 



3.411230625 



2.99943 



2.63718 



2.48686074 



2.477311485 



2.2011 



2.16591 



HLA A 1101 - 9 mers 



Maximum possible score using this molecule 






Rank 



Start position 



Sequence 



SEQ ID NO: 9699 



SEQ ID NO: 9700 



SEQ ID NO: 9701 



SEQ ID NO: 9702 



9/6 of max. score 



16.66 % 



2.77 % 



1.66 % 



1.66 % 



Score 



6 



HLA A 1101 - 10 mers 



Maximum possible score using this molecule 



Rank 



Start position 



361 






Sequence 



SEQ ID NO: 9703 



SEQ ID NO: 9704 



SEO ID NO: 9705 



SEQ ID NO: 9706 



SEO ID NO: 9707 



SEO ID NO: 9708 



SEQ ID NO: 9709 



% of max. score 



16.66 % 



8.33 % 



3.33 % 



2.77 % 



2.5 % 



2.22 % 



1.66 % 



Score 



6 



^ 73 - 
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SEO ID NO: 9710 



SEQ ID NO: 9711 



SEQ ID NO: 9712 



1.66 % 



1.66 % 



1.66 % 



Maximum possible score 



Rank H Start position 



HLA B7 - 9 mers 



using this molecule 



Sequence 






SEO ID NO: 



SEQ ID NO: 



9714 



9715 



SEO ID NO: 



SEO ID NO: 



9716 



9717 



Vo of max. score 



14.81 % 



1.48 % 



0.92 % 



0.55 % 



0.37 % 



SEQ ID NO: 



SEQ ID NO; 



SEQ ID NO; 



9718 



9719 



9720 






0.22 % 



0.18 % 



SEQ ID NO: 



SEQ ID NO: 



9722 



9723 



0.14 % 



0.11 % 



SEO ID NO; 



SEQ ID NO: 



9724 



9725 



SEO ID NO: 



SEQ ID NO: 



9726 



9727 



SEO ID NO; 



SEQ ID NO; 



9728 



9729 



SEO ID NO; 



SEQ ID NO: 



SEQ ID NO; 



9730 



9731 



9732 



0.11 % 



0.07 % 



0.07 % 



0.07 % 



0.07 % 



0.07 % 



0.07 % 



0.07 % 



0.07 % 



0.07 % 



HLA B7 - 10 mers 



Maximum possible score using this molecule 



Rank 



Start position 



Sequence 



SEQ ID NO: 



SEO ID NO; 



9733 



9734 



SEQ ID NO: 



SEO ID NO: 



9735 



9736 



Vo of max. score 



1.48 % 



1.48 % 



1.48 % 



SEQ ID NO: 



SEO ID NO; 



9737 



9738 



SEQ ID NO; 



SEO ID NO; 



9739 



9740 



1.11 % 



1.11 % 



0.37 % 



0.22 % 



0.22 % 



5400 



Score 



800 



5400 



Score 



80 
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r?_j 


218 I SEO ID NO: 9741 


0.22 % 


12 


10 1 


201 


SEO ID NO: 9742 


0.18 % 


10 


11 1 


121 


SEQ ID NO: 9743 


0.14 % 


8 


12 1 


391 


SEQ ID NO: 9744 


0.13 % 


7.5 


13 1 


112 


SEQ ID NO: 9745 


0.11 % 


6 


14 1 


385 


SEQ ID NO: 9746 


0.11 % 


6 


15 1 


47 


SEQ ID NO: 9747 


0.07 % 


4 


16 1 


66 


SEQ ID NO: 9748 


0.07 % 


4 


17 1 


95 


SEO ID NO: 9749 


0.07 % 


4 


18 1 


104 


SEQ ID NO: 9750 


0.07 % 


4 


19 J 


1 152 1 SEQ ID NO: 9751 


0.07 % 


4 


1 20 1 


1 158 1 


SEQ ID NO: 9752 


0.07 % 


4 
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TABLE 26: Ooned sequences for E.coU expression 



ORF 




Ooning I 


pET 


pGEltX 


P28 


537 


Ndel/Xhol 




P65 


1917 


Nhel / HindTIT 




NsplA 


2495 


Nhel/Xhol 






2153 


Ndel/Xhol 






2612 


Ndel/XhoI 




HHEliSSSflHIIIi 


431 


Ndel/Xhol 


BamED / Xhol 




426 


Ndel/Xhol 


BamHI / Xhol 




870 


Ndel/Xhol 




1 Nsp4 1 


249 


Ndel/Xhol 


BamHI / Xhol 




594 


Nhel/Xhol 




HBKkSTISHBli 


339 




BamHI /Xhol 




417 


Ndel/Xhol 


BamHI / Xhol 


1 Nsp9A 


1385 


Nhel /Xhol 






1409 


Ndel/Xhol 












Nspll 


1581 


Ndel/Xhol 




Nspl2 


1038 


Ndel/Hindin 






897 


Ndel/Xhol 




1 Spike (SI) 


1946 


NdeT/XhoI 






1598 


Ndel/Xhol 




Spike (S1-S2) 


3545 


Ndel/Xhol 




HRl 


287 


Ndel/Xhol 


BamHI /Xhol 


HR2 


146 


Ndel/Xhol 


BamHI / Xhol 


ORF3A100 


525 


Ndel/Xhol 




ORF4 


465 


Ndel/Xhol 






231 


Ndel/Xhol 


BamHI / Xhol 


1 KI!i7l?!7ir7TflTT^^ 


366 


Ndel/Xhol 


BamHI / Xhol 


ORF7A18 


137 


Ndel/Xhol 


BamHI / Xhol 


ORF8 


369 


Ndel/Xhol 


BamHI / Xhol 


ORF9 


135 


Ndel/Xhol 


BamHI / Xhol 


ORFIO 


120 


Nhel/Xhol 


BamHI / Xhol 


ORFll 


255 


Ndel/Xhol 


BamHI /Xhol 


Nudeocapsid (N) 


1269 


Ndel/EcoRI 




ORF12 


297 


Ndel/EcoRI 


BamHI /EcoKl 
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TABLE 27: Primers 



ORF 


Forward primer 


Reverse primer 


P28 


9803 


9818 


P65 


9804 


9819 


NsplA 


9805 


9820 


NsplB 


9806 


9821 


NsplC 


9807 


9822 


Nsp2 + Nsp3 


9808 


9823 


Nsp4 to Nsp7 


9809 


9824 


Nsp9A 


9810 


9825 


Nsp9B 


9811 


9826 


NsplO 


9812 


9827 


Nspll 


9813 


9828 


Nspl2-Nspl3 


9814 


9829 


ORF3-ORF4 


9815 


9830 


Env-ORFIO 


9816 


9831 


ORF11-ORF12 


9817 


9832 



TABLE 28: Primers 



ORF 


Forward primer 


Reverse primer 


Nsp2A 


SEQIDNO: 9833 


SEOIDNO; 9858 


Nsp2B 


SEOIDNO; 9834 


SEQIDNO; 9859 


Nsp3 


SEQIDNO; 9835 


SEOIDNO: 9860 


Nsp4 


SEQIDNO: 9836 


SEQIDNO; 9861 


Nsp5 


SEOIDNO: 9837 


SEOIDNO: 9862 


Nsp6 


SEQIDNO: 9838 


SEQIDNO: 9863 


Nsp7 


SEQIDNO; 9839 


SEOIDNO: 9864 


Nspl2 


SEQIDNO: 9840 


SEOIDNO: 9865 


Nspl3 


SEOIDNO: 9841 


SEQIDNO: 9866 


Spike SI 


SEOIDNO: 9842 


SEQIDNO: 9867 


Spike S2 


SEQIDNO: 9843 


SEQIDNO: 9868 


Spilffi S1-S2 


SEQIDNO: 9844 


SEQIDNO: 9869 


HRl 


SEQIDNO: 9845 


SEOIDNO; 9870 


HR2 


SEQIDNO: 9846 


SEQIDNO: 9871 


OrOAlOO 


SEOIDNO: 9847 


SEOIDNO: 9872 


Orf4 


SEQIDNO: 9848 


SEOIDNO: 9873 


EnvE 


SEQIDNO: 9849 


SEOIDNO: 9874 


Matrix MAIOO 


SEQIDNO: 9850 


SEOIDNO: 9875 


Qrf7A18 


SEQIDNO: 9851 


SEQIDNO: 9876 


Orf8 1 


SEQIDNO: 9852 


SEOIDNO: 9877 


Orf9 ^ 


SEQIDNO: 9853 


SEQIDNO: 9878 


OrflO 


SEQIDNO: 9854 


SEQIDNO: 9879 


Orfll 


SEOIDNO: 9855 


SEOIDNO: 9880 


Nucleocapsid N 


SEQIDNO: 9856 


SEQIDNO: 9881 


Orfl2 


SEOIDNO: 9857 


SEOIDNO: 9882 



- 477 - 






wo 2004/092360 



PCT/US2004/011710 



TABLE 29: Clonii^ purificatioii and ei^ression in KcoU 
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TABLE 30: E.coU expression, purity and yidd 




TABLE 31: Primers 



iHWjissni 


Rank 


Modd 


Local 




10235 


FI 


1 


1 


(106) 


10236 


F2 


2 


1 


(728) 


10237 


F3 


3 


1 


(112) 


10238 


F4 


5 


2 


(1331) 


10239 


F5 


6 


1 


(12) 


10240 


F6 


6 


1 


(346) 


10241 


F7 


8 


1 


(904) 


10242 


F8 


9 


1 




10243 


F9 


9 


1 


(1015) 


10244 


FIO 


9 


1 


(719) 


10245 


Fll 


9 


1 




10246 


F12 


10 


1 


(724) 1 


10247 


R1 


2 


1 




10248 


R2 


4 


1 




10249 


R3 


4 


1 


(758) 


10250 


R4 


5 


2 


(259) 


10251 


R5 


6 


1 


(54) 


10252 


R6 


7 


1 


(648) 


10253 


R7 


8 


1 


(948) 


10254 


R8 


8 


1 


(260) 


10255 


R9 


9 


1 


' (1282) 


10256 


RIO 


9 


1 


(950) 


10257 


Rll 


9 


1 


(756) - 


10258 


R12 


10 


1 


(132) 



A19- 
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TABLE 32: Primers 



Primers 


liist : 


(forwcucd) 












Scores 








Rank 


Model Local 


Segaence 


(Position) 


FI 


7. 


1 


SEQ ID NO: 


10352 


(290) 


P2 


7 


1 


SEQ ID NO: 


10353 


(291) 


P3 


7 


1 


SEQ ID NO; 


10354 


(294) 


F4 


7 


1 


SEQ ID NO: 


10355 


(292) 


F5 


7 


1 


SEQ ID NO: 


10356 


(293) 


F6 


9 


1 


SEQ ID NO: 


10357 


(198) 


F7 


9 


1 


SEQ ID NO: 


10358 


(199) 


F8 


10 


1 


SEQ ID NO: 


10359 


(33) 


F9 


11 


1 


SEQ ID NO: 


10360 


(200) 


PIO 


11 


1 


SEQ ID NO: 


10361 


(299) 


Fll 


12 


1 


SEQ ID NO: 


10362 


(298) 


F12 


12 


1 


SEQ ID NO; 


10363 


(297) 


F13 


14 


1 


SEQ ID NO: 


10364 


(35) 


F14 


14 


1 


SEQ ID NO: 


10365 


(34) 


F15 


16 


1 


SEQ ID NO: 


10366 


(300) 


FI 6 


17 


1 


SEQ ID NO: 


10367 


(295) 


F17 


17 


1 


SEQ ID NO: 


10368 


(296) 


F18 


17 


1 


SEQ ID NO: 


10369 


(175) 


F19 


17 


1 


SBQ ID NO: 


10370 


(36) 


F20 


20 


1 


SEQ IDJ^O; 


10371 


(202) 


F21 


20 


1 


SEQ ID NO: 


10372 


(201) 


F22 


28 


1 


SEQ ID NO; 


10373 


(204) 


F23 


28 


1 


SEQ ID NO: 


10374 


(203) 


F24 


29 


1 


SEQ ID NO: 


10375 


(269) 


F25 


29 


1 


SEQ ID NO; 


10376 


(268) 


Primers 


List 


(reverse) 








Rank 


Model Local 


Sequence 


(Position) 


R1 


7 


1 


SEQ ID NO: 


10377 


(337) 


R2 


9 


1 


SEQ ID NO: 


10378 


(229) 


R3 


11 


1 


SEQ ID NO: 


10379 


(230) 


R4 


11 


1 


SEQ ID NO: 


10380 


(338) 


R5 


12 


1 


SEQ ID NO : 


10381 


(207) 


R6 


12 


1 


SEQ ID NO: 


10382 


(338) 


R7 


13 


1 


SEQ ID NO; 


10383 


(231) 


R8 


14 


1 


SBQ ID NO: 


10384 


(80) 


R9 


14 


1 


SEQ ID NO: 


10385 


(232) 


RIO 


15 


1 


SBQ ID NO; 


10386 


(82) 


Rll 


16 


1 


SEQ ID NO: 


10387 


(340) 


R12 


17 


1 


SBQ ID NO: 


10388 


(83) 


R13 


17 


1 


SEQ ID NO: 


10389 


(206) 


R14 


17 


1 


SEQ ID NO: 


10390 


(82) 


R15 


17 


1 


SBQ ID NO: 


10391 


(337) 


R16 


18 


1 


SEQ ID NO: 


10392 


(341) 


R17 


20 


1 


SEQ ID NO: 


10393 


(340) 


R18 


20 


1 


SEQ ID NO: 


10394 


(233) 


R19 


21 


1 


SEQ ID NO: 


10395 


(79) 


R20 


22 


1 


SEQ ID NO: 


10396 


(213) 


R21 


28 


1 


SEQ ID NO: 


10397 


(236) 


R22 


29 


1 


SEQ ID NO: 


10398 


(317) 


R23 


32 


1 


SEQ ID NO: 


10399 


(391) 


R24 


35 


1 


SEQ ID NO: 


10400 


(57) 


R25 


36 


1 


SEQ ID NO: 


10401 


(237) 


Primers list Q&h part): SEQ ID NO®: 


10402-10433 


Primers list (right part): SEQ ID NO®: 10434-10464 


Primers List (forward): SEQ ID NO®' 


10465-10484 


Primers list (reverse): SEQ ID NO®: 10485-10504 
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TABLE 33: Primers 



Primers list (forward) 

Scores 



Rank 


HOdel 


Iiocal 


Sequence 


(Position) 


FI 


1 


1 


SEQ ID NO: 


10580 


(637) 


P2 


2 


1 


SEQ ID NO: 


10581 


(439) 


P3 


2 


1 


SBQ ID NO: 


10582 


(440) 


F4 


3 


1 


ID NO: 


10583 


(729) 


F5 


4 


1 


SEQ ID NO: 


10584 


(696) 


F6 


4 


1 


SEQ ID NO: 


10585 


(697) 


F7 


4 


1 


SEQ ID NO: 


10586 


(111) 


F8 


5 


1 


SEQ ID NO: 


10587 


(867) 


F9 


5 


1 


SEQ ID NO: 


10588 


(868) 


PIO 


5 


1 


SEQ ID NO: 


10589 


(869) 


Fll 


5 


1 


SEQ ID NO: 


10590 


(640) 


F12 


6 


1 


SEQ ID NO: 


10591 


(438) 


F13 


6 


1 


SEQ ID NO; 


10592 


(437) 


F14 


6 


1 


SEQ ID NO: 


10593 


(436) 


F15 


6 


1 


SEQ ID NO: 


10594 


(732) 


F16 


6 


1 


SBQ ID NO: 


10595 


(635) 


F17 


6 


1 


SBQ ID NO: 


10596 


(457) 


F18 


6 


1 


SBQ ID NO:' 


10597 


(458) 


F19 


6 


1 


SEQ ID NO: 


10598 


(636) 


F20 


7 


1 


SEQ ID NO: 


10599 


(854) 


F21 


7 


1 


SEQ ID NO: 


10600 


(855) 


F22 


7 


1 


SEQ ID NO; 


10601 


(581) 


F23 


7 


1 


SEQ ID NO: 


10602 


(853) 


F24 


7 


1 


SEQ ID NO: 


10603 


(342) 


F25 


7 


1 


SEQ ID NO: 


10604 


(343) 


F26 


7 


1 


SEQ ID NO: 


10605 


(112) 


P27 


7 


1 


SEQ ID NO: 


10606 


(94) 


F28 


7 


1 


SEQ ID NO: 


10607 


(642) 


F29 


8 


1 


SEQ ID NO: 


10608 


(638) 


F30 


8 


1 


SEQ ID NO: 


10609 


(639) 


F31 


8 


1 


SEQ ID NO: 


10610 


(730) 


F32 


8 


1 


SEQ ID NO; 


10611 


(641) 


F33 


8 


1 


SEQ ID NO: 


1Q612 


(731) 


F34 


8 


1 


SEQ ID NO: 


10613 


(326) 


F35 


8 


1 


SEQ ID NO: 


10614 


(325) 


F36 


9 


1 


SEQ ID. NO; 


10615 


(517) 


F37 


9 


1 


SEQ ID NO: 


10616 


(701) 


F38 


9 


1 


SBQ ID NO: 


10617 


(208) 


F39 


9 


1 


SEQ ID NO: 


10618 


(209) 


F40 


9 


1 


SEQ ID NO: 


10619 


(702) 


F41 


9 


1 


SEQ ID NO: 


10620 


(210) 


F42 


10 


1 


SEQ ID NO: 


10621 


(634) 


F43 


10 


1 


SEQ ID NO: 


10622 


(694) 


P44 


10 


1 


SEQ ID NO: 


10623 


(693) 


F45 


10 


1 


SEQ ID NO; 


10624 


(728) 


P46 


10 


1 


SEQ ID NO: 


10625 


(695) 


F47 


10 


1 


SEQ ID NO: 


10626 


(95) 


F48 


11 


1 


SEQ ID NO: 


10627 


(455) 


F49 


11 


1 


SEQ ID NO: 


10628 


(456) 


P50 


11 


1 


SBQ ID NO: 


10629 


(454) 
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Primers List (reTerse) 












Scores 








Rank 


Hodel 


Xiocal 


Seczaence 


(Position) 


R1 


1 


1 


SEQ ID NO: 


10630 


(367) 


R2 


1 


1 


SEQ ID NO: 


10631 


(666) 


R3 


2 


1 


SEQ ID NO: 


10632 


(464) 


R4 


3 


1 


SEQ ID NO: 


10633 


(669) 


R5 


3 


1 


SEQ ID NO: 


10634 


(750) 


R6 


4 


1 


SEQ ID NO: 


10635 


(720) 


R7 


4 


1 


SEQ ID NO: 


10636 


(465) 


R8 


4 


1 


SEQ ID NO: 


10637 


(370) 


R9 


4 


1 


SEQ ID NO: 


10638 


(668) 


RIO . 


4 


1 


SEQ ID NO: 


10639 


(135) 


Rll 


5 


1 


SEQ ID NO: 


10640 


(901) 


R12 


5 


1 


SEQ ID NO; 


10641 


(667) 


R13 


6 


1 


SEQ ID NO; 


10642 


(609) 


R14 


6 


1 


SEQ ID NO: 


10643 


(464) 


R15 


6 


1 


SEQ ID NO: 


10644 


(665) 


R16 


6 


1 


SEQ ID NO: 


10645 


(486) 


R17 


6 


1 


SEQ ID NO: 


10646 


(356) 


R18 


6 


1 


SEQ ID NO: 


10647 


(758) 


R19 


7 


1 


SEQ ID NO: 


10648 


(366) 


R20 


7 


1 


SEQ ID NO: 


10649 


(368) 


R21 


7 


1 


SEQ ID NO: 


10650 


(136) 


R22 


7 


1 


SEQ ID NO: 


10651 


(675) 


R23 


7 


1 


SEQ ID NO: 


10652 


(366) 


R24 


7 


1 


SEQ ID NO: 


10653 


(608) 


R25 . 


7 


1 


SEQ ID NO: 


10654 


(884) 


R26 


7 


1 


SEQ ID NO: 


10655 


(120) 


R27 


8 


1 


SEQ ID NO: 


10656 


(355) 


R28 


8 


1 


SEQ ID NO: 


10657 


(671) 


R29 


8 


1 


SEQ ID NO: 


10658 


(756) 


R30 


8 


1 


SEQ ID NO: 


10659 


(751) 


R31 


8 


1 


SEQ ID NO: 


10660 


(666) 


R32 


9 


1 


SEQ ID NO: 


10661 


(242) 


R33 


9 


1 


SEQ ID NO: 


10662 


(543) 


R34 . 


9 


1 


SEQ ID NO: 


10663 


(724) 


R35 


9 


1 


SEQ ID NO: 


10664 


(482) 


R36 


10 


1 


SEQ ID NO: 


10665 


(121) 


R37 


10 


1 


SEQ ID no’: 


10666 


(662) 


R38 


10 


1 


SEQ ID NO: 


10667 


(750) 


R39 


10 


1 


SEQ ID NO: 


10668 


(719) 


R40 


10 


1 


SEQ ID NO: 


10669 


(242) 


R41 


11 


1 


SEQ ID NO: 


10670 


(484) 


R42 


11 


1 


SEQ ID NO: 


10671 


(375) 


R43 


11 


1 


SEQ ID NO: 


10672 


(728) 


R44 


11 


1 


SEQ ID NO: 


10673 


(373) 


R45 


11 


1 


SEQ ID NO: 


10674 


(998) 


R46 


11 


1 


SEQ ID NO; 


10675 


(486) 


R47 


12 


1 


SEQ ID NO: 


10676 


(881) 


R48 


12 


1 


SEQ ID NO; 


10677 


(882) 


R49 


12 


1 


SEQ ID NO: 


10678 


(244) 


R50 


12 


1 


SEQ ID NO: 


10679 


(1003) 


1 ^ 


Primers list Qeft part): SEQ ID NO®: 


10680-10974 


Primers list (rigjit part): SEQ ID NO®: 10975-11282 


Primers list (forward): SEQ ID NO^: 


11283-11302 


Primers list (reverse): SEQ ID NO®: 11303-11322 



-4S2- 
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TABLE 34 



Compound 

# 


Structure 


Name 


MH+ 


1 




N-methy1-4-[(2-{[2-(1 - 
methylethy0phenyl]amino}-1 H- 
benzimidazol-5-yl)oxy]pyridine- 
2-carboxamide 


402.5 


2 


^r<Xyx/“"' 

CH. 


N-methyl-4-{[1 -methyl-2-({3- 
[(trimethylsil^ethyn]^phenyl}am 
ino)-1 H-benzimidazol-5- 
yl]oxy)pyridine-2-catboxamide 


470.6 


3 


o ^ 


N-methyl-4-[(1 -methylr2-{[2- 
(phenylcetrbonyOphenyl]amino}- 
1 H-benzimidazol-5- ' 
yl)oxylpvridine-2-cartx)xamide 


478.5 


4 


■'■oJ-kH, 

Mk) 

o" 


4-(methyloxy)-N-I6-(methyloxy)- 

,3-benzothiazol-2-yl]-3- 

nitrobenzamide 


360.4 

i 


5 


I 


M{2-[(4-birtylphenyl)amino]-1 ,3- 

benzothiazol-5-yl}oxy)-N- 

methytpyridine-2-carboxamide 


433.5 


6 


Q 

H.C 


N-methyl-4-({1 -methyl-2-[(6- 
pynx)lidin-1 -ylpyridin-3-yl)amino]- 
1 H-benzimidazol-5- 
/l}oxv)pvridine-2-carboxani ide 


444.5 


7 




.1 '-bi(cyck)hexyl)-2- 
/lamino]-1 -me1hyl-1 H- 
Penzlmidazol-5'yl}oxy)-N- 
methylpyridine-2-carboxamide 


462.6 


8 


H,C 


H{2-[(4-chlorophenyl)amino]-1- 
Tiethj^-1 H-b6nzimidazo^5- 
/l}oxy)-N-1 ,3-thiazol-2- 
/lpyridine-2-carboxamide 


477.9 



u 



^ 3 - 
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<H( 1 -methyl- 2 -{I 2 - 
(methyloxy)phenyI]amino}-1 H- 
t>enzimidazol-5-yi)oxy]-N-[3- 
(methyloxy)propyi]pyridine- 2 - 
carboxamide 



1 If 4-({2-[(4-ethylphenyl)amino]-1,3- 
“ benzoxazol-5-yl}oxy)-N- 

methylpvridine- 2 -caiboxcimide 



l-[(3-fluorophenyl)carbonyl]-4- 367.4 
[4- 

trlfluoromethyl)phenyl]methyl}pi 

terazine 



1 -[2-(ethyloxy)phenyl]-4-{[3,4,5- 
'CH 3 tris(methyloxy)phenyi]carbonyl}p 






1 -(3-chlorophenyl)-4-{[2- 
(ethyloxy)phenyl]carbonyl}pipera 



3-({4-[(2E)-3-phenylprop-2- 

n anyl]piperazin-1-yl}carbonyl)-7- 

° oxabicyclo[ 2 . 2 . 1 ]heptane- 2 - 

carboxylic acid 

UssAn'CH, 1-[2-(methyloxy)phenyI]-4- 

QJ T o {13.4,5- 

^ 'CH^ tris(methyloxy)phenyQcarbonyl)p 

iperazine 



[(4-pyridin-2-ylpiperazin-1 - 
4)caibonyl>7- 
xabicycio[ 2 . 2 . 1 ]heptane- 2 - 
box^ic acid 
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en 20 [b 1 oxanthrene- 6 . 1 1 -dione 



thyl 6,11-dioxo-6,11- 
ihydrobenzo[b]phenazine-2- 



l,N-dimethyl-9,10-dioxo-9,10- 
ihydroanthracene-1 - 
ulfonamide 



-(trifluoromelhyl)-3-{[3,4.5- 

is(methyloxy)phenyOcarbonyl}n 

"htho[2.3-bTfuran-4.9-dione 



2-{2-oxopropyl)-2-phenyl-1 H- 
ndene-1 ,3(2H)-dione 



thyl 4-{5-[(3- 445.4 

itrophenyl)carbonyl]-1 ,3-dioxo- 
,3-dihydro-2H-isoindol-2- 
Dbenzoate 



5,6-dichlorD-2-[2-chloro-5- 
(trifluoromethyl)phenyl]-1 H- 
so’mdole-1 ,3(2H)-dione 



bromo-4-{[(2- 

uorophenyl)methyl]oxy}-5- 

methyloxy)benzaldehyde 

iosemicarbazone 
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i-(4-2iinino-2-oxo-1 -propyl-1 ,2- 
ihydroquinolin-3-yl)-l H- 
enzimidazole-&carbonitrile 




N— ([ / ^ 



6 ~ 6 



4-amino-6-fluoro-7-({[4- 
(methyloxy)phenyqinethyl}amino 528.6 
-3-[5-(4-methy1piperazin-1 -yl)- 
I H-benzimidazol-2-yQquinolin- 
1H)-one 



k^loro-3-(5-chloro-1 H- 
>en2lmidazol-2-yl)-4-{[2- 
dimethylamino)ethyl]amino}quin 
lin-2(1H)-one 










N^NH 



.ja 



5,5-dimethyl-4-methylidene-3- 
(2,4,6-trinilrophenyl)-1 ,3- 
oxazolidin-2-one 



5-methyl-2-[4- 

(methyloxy)phenyl]hexahydro- 
1 H-isoindole-1 .3(2H)-dione 



-methyl-2-(4- 

)ethylphenyl)hexahydro-1 H- 
oindole-1 ,3(2H)-dione 



1-2— (4-chlorophenyl)-6,6- 
imelhyl-1 ,6-dihydro-1 ,3,5- 
iazine-2.4-diamine 
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CH,o^O 

; 



(7Z)-7-(furan-2-ylmethylidene)-3 
3henyl-3,4-clihydro-2H- 
1 ,3]thiazolo[3,2-a][1 ,3.5]triazin- 
5f7H)-one 



3aR,9R,9aR)-6,7-dihydroxy-9- 
3,4,5-tris(methyloxy)phenyl]- 
)a, 4,9,9a- 

etrahydronaphtho[2,3-c]furan- 
3H)-one 



6-chloro-2-(ethyloxy)-4-methyl-3- 387.8 
(4-nltrophenyl)-3M.9.9a- 
etrahydro-3H-pyrrolo[2,3- 
uinoxaline 



thyl 2-(ethylo)cy)-4-methyl- 
a,4,9,9a-tetrahydro-3H- 
yirolo[2,3-b]quinoxaline-3- 
arbox^ate 











hyl4-({[2,5- 






is(methytoxy)phenyl]amino}met 
yl)-3,5-dimethyl-1 H-pyrrole-2- 



I -{3-[(6-amino-5-nftropyridin-2- 
l)amino]propyl}-4-(2- 
hlorophenyl)-N-[(2S)-2- 
iydroxypropy1]-1 H-pyrrole-3- 
rboxamide 










2S,5R)-N~1— (4-methylphenyl)- 429.5 

■phenyl-N~2— (2-pyridin-2- 
ethyl)pyrrolidine-1 ,2- 
icarboxamide 
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4(3S)-3-(acetylamino)-2- 
xopyrrolidin-1 -yl]-N-I2-(4- 
uorophenvOethvllacetaniide 



[2-(2,4-dichloropheny0ethyI]- 

({(Z)-[(4,4- 

ifluorocyclohexyl)imino][(3S)-3- 

iethylpiperazin-1- 

methyl}amino)benzamide 



[4-(methyloxy)phenyl]-5- 

henylisoxazole 






ethyl 4-{[4-(1 -methylethyl)-2,3- 421 .4 

ioxo-7-(trffluoromethyl)-3,4- 
ihydroquinoxalin-1 (2H)- 
InethyDbenzoate 



(3beta,16beta)-3,14,16- 

trihydroxybufa-20,22-dienolide 




-(aminomethyl)-l -(2-pyridin-2- 
ethy0quinazolin-4(1 H)-one 



thyl 4-{[5-[3.4- 
is(methyloxy)pheny1]-7- 
trifluorometh^)pyrazolo[1 ,5- 
i]pyrimidin-3- 
i]carbonyl}piperazjne-1 - 
arboxylate 



[3,4-bis(methyloxy)pheny0-3* 
iperidin-1 -ylcarbonyO-7- 
(trifluoromethyl)pyrazolo[1 ,5- 
'“lyrimidine 






- 490 - 
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[3,4-bis(methyloxy)phenyl]-N- 
ethyl-N-(2-pyridin-2-yIethyl)-7- 
trifliioromethyl)pyrazoIo[1 ,5- 
imidine-2-carboxamide 



3)inj 




»-propyl-2-thien-2- 
Ipyrazolon ,5-alpyrimldin-7-ol 
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H-N y f 



H3C^'CH3 



HO, 



ho^V?TH 

OH 






CH, 



Table 35 



Source 



Aventis Pasteur 





iRoche Btosdence 



Uterature Reference I Patent Number 



1) Lang, J.-M.;Touraine, 
J.-L;Trepo. C. et aL 
Lancet 1988, 2(6613): 
702 - 5 , 



Dong, M.K.etal. 

Phannacologist 1988, 

30(3): Abst 87.8. ES TO02792 



Mizuno, O. et al. 4th !nt 
Ckmf Immunopharmacd 
(May 15-19, Osaka) 

1988, Abst WS6-3. EP236929 




1) Iwami, M. et aJ. J 
Antiblot 1987, 40(5): 612 
22. JP 87161796 





US 4725622 



US 4727069 



1) Nishikaku, F. and 
Koga,Y.4thlntConf 
Imntunophannacol (May 
15-19, Osaka) 1988, 

Abst WS6-8 . EP 248399 



492 
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19 




Novart^ 




FR 2604177 


20 




ADiR 




AU 8811669 


21 


Cxy«^ 


Pharmacy 




AU 8810908 


22 


O OM 

O 


Avenfis Phanna 




EP 284461 


23 


k^NH, 
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BRIEF DESCRIFnON OF SEQUENCE LISTING 






roNO: 



1 




21-520 



521-1020 



1021-3520 



3521-6020 



6021-6026 



6027-6033 



6034^8 



6039-6051 



6052 



6053-6056 



6057-6060 



6061-6064 
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6069-6071 
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Draft genome assembly from The Genome Science Cento* in British Colombia, 
Canada of sequence from TOR2 isolate. WR2j!raft_jienomjJssemMv 120403 Release 1 



CPC SARS-CoV strain sequence. Entire nucleotide sequence (Urbani strain) 
Group-specific coronavirus gene products 

> Feline infectious peritonitis virus (FDPV) 

3/4 = ORF 3b; 5/6 = ORF 3X; 7/8 = ORF 3A 

> Canine coronavirus 

9/10 = ORF 7b; 1 1/12 = ORF 7a 

> Avian infectious bronchitis virus 

13/14 = ORF 5b; 15/16 = ORF 5a; 17/18 = ORF 3a; 19/20 = ORF 3b 
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ID NO: 6039 
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AmiiK) acids 879-1005 of SEQ ID NO: 6042 
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NadA-based polypeptides 
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Orflab polYDrotein 
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uences for SEQ ID NOS 7233-7244 
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Zinc binding region 2 site 



N-glycosyladon sites in SEQ ID NOS: 6040-41,6043,6045-46,6050-51 
Polypeptides and polynucleotides 



Intcrgenic sequences 

Nucleotides firom 5' end of SARSV genome followed by intergcnic seqn 
NadA constructs 



I Fragments of SEO ID NO: 6042 



NadA 



NadA leader 



Amino acid sequencess from NadA 



PCRprimas 



Primal 



(X!ACC sequence 



3* UTR forward 



3* UTR revCTse primears 



3* UTR probes 



5' UTR forward 



5 UTR reverse primers 



y UTR probes 

Conserved octanuclcotide • 

^Rcvofsc complement o f SEQ ID NO: 7293 
Intergenic seauence 



PolyT 



Stem-loop sequence 



cine linkers 






Nucleocapsid epitope site 



Antisense 



Probes 



Antigenic fragments of SEQ ID NO: 6042 



T-epitope analysis of SEQ ID NO: 6039 



T-epitope analysis of SEQ ID NO: 6040 



T-epitope analysis of SEQ ID NO: 6041 



T-epitope analysis of SEQ ID NO: 6042 



T-epitope analysis of SEQ ID NO: 6043 



T-epitope analysis of S 



ID NO: 6044 



T-cpitope analysis of SEQ ID NO: 6045 



ID NO: 6046 



T-epitope analysis of SEQ ID NO: 6047 



I T-epitope analysis of SEQ ID NO: 6048 



T-epitope analysis of SEQ ID NO: 6050 



ID NO: 6052 



Primes for spike protein amplificatiop, particularly fra 



lation sites within SEQ ID NO: 6039 



Cleavage products for ORFlab (Table 10 



T-epitope analysis of 
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T-epitope analysis of SEQ ID NO: 6049 
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9780-9782 


Forward prima, reverse primer, probe 


9783-9784 


Lysine-rich region 


9785-9798 


Oligonucleotides used for S.cerevisiae expression 


9799-9802 


Sequ^ices fromHgures 65 & 66 


9803-9882 


Primes for Kcoli cloning 


9883-9885 


BCV nucleotide seouences for Hgures 3A, 3B. 3C 


9886-9891 


BCV amino acid sequences for Figures 4A, 4B, 4C, 4D. 4K 4F 


9892 


BCVS'UTR ^ 


9893 


BCV3’UTR ■ 
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MHV nucleotide sequences for Rgures 3A, 3B. 3C 
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METV amino acid sequences for Rgures 4A, 4B, 4C, 4D, 4R 4F 
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AIBV nucleotide sequences for Figures 3A, 3B 


9905-9909 


AIBV amino acid sequences for Hgures 4A, 4B, 4D, 4E. 4F 


9910 


ATOV5*UTR 


9911 


AIBV 3’UTR 


9912-9913 


HOBMPRO, HOBBDEGA nucleotide sequences for Figures 3B 3C 


9914-9918 


Human CoV amino acid sequences for Hgures 4A. 4B. 4C. 4E. 4F 


9919 


HCoV-OC43 5'UTR 


9920 


HCOV-OC43 3’UTR — 


9921-9923 




9924-9926 


Codon-optimised N, M and E sequmices 
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^NI-1 : 


9928-9959 


Constitumit amino acid sequences >4aa inferred fix>mSEO ID NO* 9997 


9960 


ORFlab variant 
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ORFla variant 




Spike variant 


9963 


Membrane variant 


9964 


Nucleocapsid variant 


9965-9966 


Short ORFs 
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FRA complete genome 
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CLAIMS 

1. An isolated polypqptide of the SARS vims. 

2. The polypeptide of claim 1 , wherein the polypeptide is a Spike (S) polypeptide, an Env 
(B) polypeptide, a Membrane (M) polypeptide, a hemagglutinin-esterase polypeptide CHE), a 

5 nucleocq>sid (N) polypeptide, a ORFla polypeptide, a ORFlab polypeptide, a proteolytic 
fragment of a ORFla polypeptide, or a proteolytic fragment of a ORFlab polypeptide. 

3. The polypeptide of claim 1 , wherein the polypeptide comprises an amino acid sequence 
selected from the group consisting of SEQ ID NO": 6039, 7232, 9766, 9767, 9768, 9769, 9770, 
9771, 9772, 9773, 9774, 9775, 9776, 9777, 9778, 9779, 6042, 6043, 6044, 6045, 6046, 6047, 

10 6048, 6049, 6050 or 6052. 

4. The polypeptide of claim 1, wherein the polypeptide comprises an amino acid sequence 
having >75% sequence identity to an amino acid sequence selected from the group consisting of 
SEQ ID NO**: 6042, 6043, 6044, 6045, 6046, 6047, 6048, 6049, 6050, 6052, 9766, 9767, 9768, 
9769, 9770, 9771, 9772, 9773, 9774, 9775, 9776, ^77, 9778, 9779, 9997, 9998, 10149, 10316, 

15 10338, 10339, 10340, 10341, 10342, 10532, 10533, 10571, 10572, 10573, 10574, 10575, 10576, 

10577. 10578. 10579. 11561. 11562. 11618. 11619. 11620. 11627. 11630. 11633 & 11636. 

5. The polypeptide of claim 1, wherein the polypeptide comprises a fragment of at least 10 
consecutive amin o acids of an amin o acid sequence selected from the group consisting of SEQ 
ID NO®: 6042, 6043, 6044, 6045, 6046, 6047, 6048, 6049, 6050, 6052, 9766, 9767, 9768, 9769, 

20 9770, 9771, 9772, 9773, 9774, 9775, 9776, 9777, 9778, 9779, 9997, 9998, 10149, 10316, 10338. 

10339, 10340, 10341, 10342, 10532, 10533, 10571, 10572, 10573, 10574, 10575, 10576, 10577, 

10578. 10579. 11552. 11561. 11562. 11618. 11619. 11620. 11627. 11630. 11633 & 11636. 

6. A polypeptide comprising an amin o acid sequence having >80% sequence identity to an 
amin o add sequence selected from the group consisting of SEQ ID NO®: 6042, 6043, 6044, 

25 6045, 6046, 6047, 6048, 6049, 6050, 6052, 9766, 9767, 9768, 9769, 9770, 9771, 9772, 9773, 

9774, 9775, 9776, 9777, 9778, 9779, 9997, 9998, 10149, 10316. 10338, 10339, 10340, 10341, 
10342, 10532, 10533, 10571, 10572, 10573, 10574, 10575, 10576, 10577, 10578, 10579, 11552, 

11561. 11562. 11618. 11619. 11620. 11627. 11630. 11633 & 11636. 

7. A polypeptide comprising an amin o acid sequence that comprises a fragment of at least 

30 10 consecutive amino acids of an amino add sequence selected from the group consisting SEQ 

ID NO®: 6042, 6043, 6044, 6045, 6046, 6047, 6048, 6049, 6050, 6052, 9766, 9767, 9768, 9769, 
9770, 9771, 9772, 9773, 9774, 9775, 9776, 9777, 9778, 9779, 9997, 9998, 10149, 10316, 10338, 
10339, 10340, 10341, 10342, 10532, 10533, 10571, 10572, 10573, 10574, 10575, 10576, 10577, 

10578. 10579. 11552. 11561. 11562. 11618. 11619. 11620. 11627. 11630. 11633 & 11636. 
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8. A polypeptide compiising an amino acid sequence having >80% sequence identity to 
SEQ ID NO: 6042, and/or comprising an amino acid sequence that comprises a fragment of at 
least 10 consecutive amino acids of SEQ ID NO: 6042, wherein the polypeptide is in the form of 
atrimer. 

5 9. Nucleic acid encoding the polypeptide of any one of claims 1 to 8. 

10. Nucleic acid according to claim 9, comprising a nucleotide sequence selected from the 
group consisting of SEQ ID NO®: 7191, 7273, 7275, 7277, 7279, 7281, 7283, 7285, 7287, 7289, 
7291, 7292, 7293, 9968, 10066, 10084, 10299, 10505, 11323, 11563, 11639 & 11640. 

11. A polynucleotide comprising a nucleotide sequence having >80% sequence identity to 
10 the nucleic acid of claim 9 or claim 10. 

12. A polynucleotide comprising a fragment of at least 10 consecutive nucleotides of the 
nucleic add of claim 9 or cl aim 10. 

13. Antibody that recognizes tihe polypeptide of any one of claim 1 to 8. 

14. The antibody of c laim 13, wherein said antibody recognizes the polypeptide comprising 
15 the amin o add sequence of SEQ ID NO: 6042 or a fragment thereof. 

15. The antibody of claim 14, wherein said antibody recognizes the polypeptide comprising 
the amin o add sequence of SEQ ID NO: 6042 or a fragment thereof in trimeric form. 

16. The antibody of claim 13, wherein the antibody is a monoclonal antibody, 

17. The antibody of claim 13, wherdn the. antibody is a human antibody, 

20 18. An immunoassay for detecting a SARS virus antigen in a sample, comprising the step of 

contacting the sample with the antibody of any one of claims 13 to 17. 

19. An immunoassay for detecting an antibody against a SARS virus antigen in a sample, 
comprising the step of contacting the sample with the polypeptide of any one of claims 1 to 8. 

20. A method of detecting an antibody against a SARS virus antigen in a sample comprising 
25 contacting said sample with the polypeptide of any one of claims 1 to 8, under conditions 

suitable for binding said polypeptide to said antibody, if present, and detecting the binding of 
said polypeptide to said antibody. 

21. A method for detecting a SARS virus antigen in a sample comprising contacting said 
sample with the antibody of any one of claims 13 to 17, under conditions suitable for binding 

30 said antibody to said anti^n, if present, and detecting the binding of said antibody to said 

antigen. 
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22. A vaccine for the treatment or prevention of sevrae acute respiratory syndrome (SARS), 
comprising an inactivated SARS virus, a killed SARS virus, an attenuated SARS virus, a split 
SARS virus preparation, or at least one purified SARS virus antigens. 

23. Ihe vaccine of claim 22, comprising a purified polypeptide according to any one of 
5 claims 1 to 8. 

24. The vaccine of claim 22 or claim 23, wherein the antigen is a purified SARS virus 
antigen in the form of a VLP. 

25. The vaccine of any one of claims 22 to 24, further comprising an adjuvant 

26. The vaccine of claim 25, wherein the adjuvant is an aluminium salt or is MF59. 

10 27. The vaccine of any one of claims 22 to 26, comprising more than one SARS virus 

antigen. - 

28. The vaccine of claim 27, wherein the antigens are selected from S, E, N and M. 

29. The vaccine of claim 22, comptising an inactivated SARS virus. 

30. The vaccine of claim 29, wherein said virus is inactivated by chemical or physical means. 

15 31. The vaccine of claim 30, wherein said inactivation comprises treatment of the virus with 

an effective amount of one or more of the following agents selected fiem the group consisting of 
detergents, formaldehyde, formalin, P-propriolactone, and IJV light. 

32. The vaccine of claim 30, wherein said inactivation comprises treatment of the virus with 
an effective amount of one or more of the following agents selected from the group consisting of 

20 methylene blue, psoralen and carboxyfuUerene (C60). 

33. The vaccine of claim 30, wherein said inactivation comprises treatment of the virus with 
an effective amount of one or more of the following agents selected from the group consisting of 
binary ethylamine, acetyl ethyleneimine and gamma irradiation. 

34. The vaccine of claim 31, wherein said-inactivation comprises treatment with P- 
25 propriolactone. 

35. The vaccine of claim 34, wherein said p-propriolactone is used at a concentration of 0.01 
to 0.5%. 

36. Hie vaccine of claim 34, wherein said P-propriolactone is used at a concentration of 0.5 
to 0.2%. 

10 37. The vaccine of claim 34, wherein said P-propriolactone is used at a concentration of 

0.025 to 0.1%. 
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38. A method of inactivating SARS virus comprising exposing the virus to an inactivation 
agent for 12 to 24 hours at refrigeration temperatures followed hydrolysis of any lesidual 
inactivating agent by elevating the temperature for three hours. 

39. The method of claim 38, wh^in the inactivation agent is p-prdpriolactone. 

5 40. The method of claim 38, wherein the refrigeration temperature is between 0"C and 8®C. 

41. The method of claim 38, wherein the elevated temperature is between 33®C and 41®C. 

42. A method for making an inactivated SARS vaccine comprising: 

. a innoculating a mammalian cell culture with SARS virus; 
b. cultivating the infected cells; 

10 c. harvesting SARS virus containing supernatant; 

d. inactivating the SARS virus; and 

e. purifying the inactivided SARS vims. 

43. The method of claim 42, wherein said manunalian cell culture is derived from one or 
more of the ceD types selected from the group consisting of fibroblast, cells, endothelial cells, 

15 hepatocytes, keratinocytes, immune cells, mammary cells, smooth muscle cells, melanocyte 
cells, neural cells, prostate cells, renal cells, skeletal cells, liver cells, retinoblast cells and 
stromal cells. 

44. Ihe method of claim 42, wherein said mammalian cell culture is derived from a cell 
culture selected from the group consisting of hiunan cells, non-human primate cells, HeLa cells, 

iO human diploid cells, fetal rhesus lung cells, hmnan embryonic kidney cells, VERO cells, horse 
cells, cow cells, sheep cells, dog cells, cat cells or rodent cells. 

45. The method of claim 42, wherein said mammalian cell culture is derived from VERO 
cells or fetal rhesus kidney cells. 

46. The method of claim 42, wherein said mammalian cells are cultured in serum free 

15 47. The method of claim 42, wherein said mammalian cells are cultured in protein free 

media. 

48. The method of claim 42, wherein said inoculating step comprising absorbing the SARS 
virus onto the cell culture for 60 to 300 minutes. 

49. The mediod of claim 42, wherein said inoculating step is conducted at 25®C to 40^. 

•0 50. The method of claim 42, wherein said purification step comprises one or more of the 

treatments selected from the group consisting of gradient centrifugation, ultracentrifugation, 
continuous-flow ultracentrifugation, chromatography, polyethylene glycol precipitation, and 
ammonium sulfate precipitation. 
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51. Tlie method of claim 42, wherein said puiification step comprises one or more of the 
treatments selected firom the group c onsisting of ultrafiltiation and dialfiltration. 

52. The method of claim 50, wherein said chromatography treatment includes one or more of 
die chromatography treatments selected from the group consisting of ion exchange 

5 chromatography, size exclusion chromatography, and liquid affini ty chromatography. 

53. The method of claim 52, whaein said chromatography treatment i ncl udes use of one 
more chromatographic resins selected from die group consisting of an an anionic resin and a 
cationic resin. 

54. The method of claim 52, wherein the ion exchange chromatography treatment includes a • 
10 first step using a strong anion exchange resin and a second step using a strong cation exchange 

resin. 

55. The method of claim 50, wherein said gradient centrifugation purification step comprises 
density gradient centrifugation. 

56. Ihe method of claim 42, wherein said purification step comprises a first step of 

15 chromatography purification and a second step of gradient centrifugation. 

j, . . . . 

57. The method of claim 56, wherein said first chromatography purification step comprises 
liquid affinity chromatography. 

58. The method of claim 56, wherein said second gradient centrifugation step comprises 
density gradient centrifugation. 

20 59. A single-stranded oligonucleotide comprising a nucleotide sequence selected ficom the 

group consisting of SEQ ID NOS: 21-6020, 6076-6568, 6586-6587, 7292-7301, 7325-7328, 
7332-7352, 7353-7385, 10235-10298, 10352-10504, 10580-11322 and 11325-11551. 

60. A single-stranded oligonucleotide comprising the complement of the oligonucleotide of 
claim 59. 

25 61. The oligonucleotide of claim 59 OT claim 60, comprising 10-30 nucleotides. 

62. The oligonucleotide of claim 61, comprising the nucleotide sequence of SEQ ID NO: 

7292, SEQ ID NO: 7293, Ae complement of SEQ ID NO: 7292 or Ae complement of SEQ ID 
NO: 7293. 

63. A kit conoprising primers for amplifying a template sequence contained within a SARS 
30 virus nucleic acid target, Ae kit comprising a first primer and a second primer, wherein Ae first 

primer comprises a sequence substantially complementary to a portion of said tftm platft sequence 
and Ae second primer comprises a sequence substantially complementary to a portion of Ae 
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complement of said tenq>late sequence, wherein the sequences within said primers which have 
substantia] conq)lementarity define the tennini of the template sequence to be amplified- 

64. The kit of claim 63, wherein the template sequence is contained within SEQ ID NO: 1 
and/or SEQ ID NO: 2. 

5 65. The kit of claim 63 or claim 64, wherein the first primer comprises a fragment of 8 or 

more nucleotides of SEQ ID NO: 1, and the second prims’ comprises a fragment of 8 or more 
nucleotides of the complement of SEQ ID NO: 1. 

66. The kit of claim 63 or claim 64, wherein the first prims' comprises a fragment of 8 or 
more nucleotides of SEQ ID NO: 2, and the second prims comprises a fragment of 8 or more 

10 nucleotides of the complement of SEQ ID NO: 2. 

67. The kit of claim 63, wherein the first prims is an oligonucleotide according to any one of 
claims 59 to 62 and the second primer is an oligonucleotide according to any of claims 59 to 62. 

68. The kit of any one of claims 63 to 67, furths comprising a labeled probe that comprises 

either a fragment of 8 or more nucleotides of SEQ ID NO: 1 and/or SEQ ID NO: 2, or the- 
15 complement of said fragment, which fragment is located within the templatp. sequence. . ' 

69. , The kit of any one of claims 63 to 68, wherein the first primer and/or the second primer 
comprises a nucleotide sequence selected from the group consisting of SEQ ID NOS: 21-6020, 
6076-6568, 6586-6587, 7292-7301, 7325-7328, 7332-7352, 7353-7385, 10235-10298, 10352- 
10504, 10580-1 1322 and 11325-11551. 

50 70. The kit of any one of claims 63 to 68, wh^in the first primer ^d/or the second primer 

comprises the complement of a nucleotide sequence selected from the group consisting, of SEQ 
ID NOS: 21-6020, 6076-6568, 6586-6587, 7292-7301, 7325-7328, 7332-7352, 7353-7385, 
10235-10298, 10352-10504, 10580-11322 and 11325-11551. 

71. A method of detecting the presence of SARS virus in a sample comprising providing a 

.5 sample suspected of containing a SARS virus nucleic acid target, amplifying a template sequence 
contained within said SARS virus nucleic acid target with the kit of any one of claims 63 to 70, 
and detecting the amplified tenq>late sequence, wherein the presence of the amplified template 
sequence indicates the presence of SARS virus in said sample. 

72. The method of claim 71, wherein said amplifying is accomplished using polymerase 
0 chain reaction, transcription mediated amplification, reverse transcription PCR, ligase chain 

reaction, strand displacement amplification or nucleic acid sequence-based amplification. 

73. A double-stranded RNA molecule with a length fium about 10 to about 30 nucleotides 
which is able to inactivate the SARS coronavirus in a mammaliim cell. 
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74. The double-stranded RNA of claim 73, wherein the sequence of one of the strands is at 
least 90% identical to a tar^t sequence, wherein the tar^t sequence is a fragment of SEQ ID 
NO: 1 and/or SEQ ID NO: 2. 

75. The double-stranded RNA of cl aim 73 or c laim 74, wherein the target sequence 

5 comprising a nucleotide sequence selected from the group consisting of SEQ ID NOS : 7292, 

7293, 7294, 7295, 7296, 7297, 7298, 7299, 7300 and 7301. 

t 

76. Tlie double-stranded RNA of any one of cl aim.*; 73 to 75, comprising at least one 
modified nucleotide. 

77. . A method for treating a patient suffering from SARS, comprising: administmng to the 
10 patient a dierapeutically effective dose of a molecule of less than 1000 g/mol. 

78. The method of claim 77, wherein the molecule has an aromatic region and greater than 
one -heteroatom selected from O, S, or N. 

79. A method for treating a patient suffering from SARS, comprising: administering to the 
patient a therapeutically effective dose of a compound selected from: a nucleoside analog, a 

15 peptoid, an oligopeptide, a polypeptide a protease inhibitor, a 3C-like protease inhibitor , a 
papain-like prptease inhibitor, or an inhibitor of an RNA dependent RNA polymerase. 

80. A method for treating a patient suffering from SARS, comprising: administering to the 
patient a steroidal anti-inflammatory drug in combination with at least one antiviral compound. 

81 . A method for treating a patient suffering from SARS, comprising: admini.stftring to the 
20 patient a therapeutically effective dose of a compound selected from: acyclovir, gancyclovir, 

vidarabidine, foscamet, cidofovir, amantidine, ribavirin, trifluorothymidine, zidovudine^ 
didanosine, zalcitabine, an antiviral compound listed in Table 1; an antiviral compound listed in 
Table 2; or an interfM'on. 

82. The method of claim 81, wh^in the interferon is an interferon-a or an interferon-p. 

25 83. The method of any one of claims 77 to 82, wherein the molecule or compound is 

delivered by inhalation. 

84. A metiiod of idmtifying a th^apeutically active agent comprising the steps of: (a) 
contacting a therapeutically active agent with a cell infected with the SARS virus; (b) measuring 
attmmation of a SARS related enzyme. 

10 85. A viral vector or particle for in vivo delivery of a nucleic acid of claim 9 or c laim 10. 

86. The viral vector of claim 85, wherein the vector is an adenovirus vector, a poxviras 
vector or an alphavirus vector. 

87. An alphavirus replicon particle comprising one or more SARS viral antigens. 
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88. The leplicon particle of claim 87, wherein said SARS viral antigen is a spike protein. 

89. The replicon particle of claim 87, whraein said particle comprises a replicon derived firom 
Venezuelan Equine Bicephalitis (VEE) and further comprises an envelope derived from Sindbus 
virus (SIN) or S emlilci Forest Virus (SFV). 

5 90. A vaccine comprising one or more SARS virus antigens and one or more respiratory 

virus antigens. 

91. The vaccine of claim 90, wherein said respiratory virus antigens are selected from the 
group consisting of influCTza virus, human rhinovirus OHRV), parainfluenza virus (PIV), 
respiratory syncytial virus (RSV), adenovirus, metapneumovirus, and rhinovirus. 

10 92. The vaccine of claim 91, wherein said respiratory virus antigen is from influenza virus. 

93. The vaccine of claiin 90, wherein said respiratory virus antigen is fiom a coronavkus 
other than the SARS virus. 

94. A polypeptide comprising an momimogenic, surface exposed fragment of the amino acid 
sequence SEQ ID NO: 6042. 

15 95. The polypeptide of claim 94, wherein said fragment does not include the last 50 amino 

acids of the C-terminus of SEQ ID NO: 6042. 

96. The polypeptide of claim 94, wherein said fragment does not include a transdomain 
region of SEQ ID NO: 6042. 

97. The polypeptide of claim 94, wherein said fiagment does not include a C-tenninus 

► > 

20 cytoplasmic domain of SEQ ID NO: 6042. 

98. The polypeptide of claim 94, wherein said fiagment does not include a N-terminus signal 
sequence. 

99. An isolated polynucleotide conq>rising a nucleic acid sequence selected fium the group 
consisting of SEQ ID NOS: 9968 and 10066. 

25 100. The polynucleotide of claim 99, wherein the polynucleotide comprising a nucleic acid 

sequence having > 80% sequmce identity to a polynucleotide sequence selected from the group 
consisting of SEQ ID NOS: 9968 and 10066. 

101. An isolated polynucleotide comprising a fragment of at least 15 consecutive nucleic acids 
of a nucleic acid sequence selected from the group consisting of SEQ ID NOS: 9968 and 10066 

30 and wherein said fiagment does not consist entirely of SEQ ID NO: 10033. 

102. An isolated polypeptide comprising an amino acid sequence encoded by any one of 
claims 99-101. 
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103. TTie polypeptide of claim 102, comprising an amino acid sequence selected from the 
group consisting of SEQ ID NOS: 9969 - 1(K)32, 10067, and 10015. 

104. The polypeptide of claim 103, wherein the amino acid sequence is selected from the 
group consisting of SEQ ID NOS: 9997, 9998 and 10015. 

5 105. An expression construct for recombinant expression of a SARS virus spike protein 

wherein said construct comprises a nucleic add sequence selected fiom the group consisting of 
SEQ ID NOS: 6578 - 6583. 

106. A mammalian cell line stably expressing a SARS viral antigen. 

107. Ibe cell line of claim 106, wherein said cell line is a Chinese Hamster Ovary (CHO) cell. 

10 108. The cell line of claim 106, wherein the SARS viral antigen is a spike protein or fragment 

thereof. 

109. The cell line of claim 106, wherein the spike protein is truncated to remove the 
transmembrane sequence. 

110. A method of identifying a therapeutically active agent comprising the steps of: (a) 

15 contacting a therapeutically active agent with a buffer comprising SARS enzyme; and (b) 

measuring attenuation of the SARS enzyme. 

111. The method of claim 110 wherein the SARS enzyme is a SARS protease. 

112. The method of claim 111 wherein the buffer further comprises a peptide with a SARS 
protease cleave site. 

20 113. The method of claim 1 10 wherein the measurement is made by the measurement of 

fluorescence. 

1 14. A vaccine of one of claims 22 to 37, and 90 to 93 further comprising an adjuvant. 

115. The vaccine of claim 114 wherein the adjuvant is a SMIP. 

116. The vaccine of claim 115 wherein the SMIP compound is selected from the group 
25 consisting of an acylpiperazine, a tryptanthrin, an indoledione, a tetrahydroisoquinoline, a 

benzocyclodione, an amino azavinyl compound, a thiosemicarbazone, a lactam, an 
aminobe nzimi dazole quinolinone, a hydropthalamide, a benzophenone, an isoxazole, a sterol, a 
quinazolinone, a pyrole, an anthraquinone, a quinoxaUne, a triazine, an benzazole, and a 
pyrazolopyrimidine, or a pharmaceutically acceptable salt, ester, or prodrug thereof. 

30 117. A method of vaccinating a subject cortqmsing administering a vaccine of one of daiirig 

22 to 37, and 90 to 93. 

118. TTie method of claim 117 further comprising administ^ng a SMIP. 
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119. A method for treating a patient of one of claims 77 to 82 furtb^ conqirising - . 
administming at least one SMIP compound. 

120. A mediod for treating a patient of one of claims 77 to 82 further comprising 
administedng at least one SMIS compound. 
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FIGURE 3C (contd.) 
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FIGURE 3C (contd.) 
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FIGURE 4A 
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FIGURE 4A (contd.) 
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FIGURE 4A (contd.) 
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FIGURE 4A (contd.) 
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FIGURE 4B (contd.) 
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FIGURE 4C (contd.) 
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FIGURE 4F (contd.) 
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SEQ 


ID 


NO: 6069 


SEQ 
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00:6042 


SEQ 


ID 


NO: 6072 



SEQ 


ID 


NO: 6053 


SEQ 


ID 


NO:6057 


SEQ 


ID 


NO: 6061 


SEQ 


ID 


NO:6065 


SEQ 


ID 


NO: 6069 


SEQ 


ID 


00:6042 



MPVLLVAYALLH 12 

MKLFmVVMPLIYGiajPPCSKLamTIGMQmiBTPIIiNySSRIiPPNSDVV^ 60 

MRSLIYFWLI^PVLPTLSLPQDmCQSTTNFRRFPSKFN ^VQAPAVWLGG 52 

imiimSIiPTAPAVIGDIiKCTIVS-INIWiyr^ 38 

MLFVFILFIJSOiGyiGDFRCIQLVNSNGANVSAPSIS 40 

HFIFLLFLTLTSGSDLDRCTTFDnVQAPNYTQHTSSMRG 39 

MLWPIJjLVTU^ALCSAVLTOSSS 27 



YFmQPWFNCIBNDSNDLYVTLENIiKALYVroYATENITWNMQRIiNVVVNGYPYSITVT 120 

YLPSMNSSSWYCGT6IBTAS6VH6IFLSYIDSGQGFEIGIEQEP FDPSGYQLYLH 107 

TDTVDVTNGMTYYVm — YY 69 

THWSQGLGTYYVIJDRVYliNATLLLTG YY 71 

VYTPDKIPRSDTLYLTQDLFLPFYSNVTGFHTIUHTFG — HP 79 

^ YVYYYQSAFRPPSGWHLQG-- 46 



lAGCQTTN GLNTSYSVCKG 31 

TTRraSAEGAIICICKGSPPTTTTESSLTCNWGSECRLNHKFPICPSNSEANCGNMLYG 180 
KATNGNTNAIARIJlICQPPimTLGPTVNDVTTORNCIiFmiPAYMRDGKDIW 167 
PTSGSTYBNMALKGTLLLSTIMKPPPI.SDraGIFAKVKM'mKIXSVMSEFPAra 129 
PVIXSSKPiajIJUliTGTNSVSLSWPQPPYLSQFNDGIFAKVQNLKTSTPSGATAYFPTIVIG 13 1 
VXPFKDGIYFAATEKSNmGHVFGSTHEnSDKQSVIIIKNSTlimRAai^^ 139 
GAYAVVNISSEFNNAGSSSGCTVGIIHGGRVVNASSIAMTAP---- 88 



LQWFADEVVAYLHGASYRISFBNQWSGTVTFGDMRATTLEVAGTLVDLWWFNPVYDVSYY 240 

lTORVTVFADKIYHFYLKiraSRVATRCYlC«lSC7\MQYVYT 214 

STFVOTSYSVVVQPHTTILGNKLQGFIiEISVCQYTMCEYPNT 171 

SLFGYTSYTWIEPYN GVIMASVCQYTICLLPYT 165 

VS KEMGTQOmHIFIfflaFH 158 



(WSENVFAVESGGYIPSDFAFNNWFIiL'PNTSSVVDGVVRSF 74 

RVNNKNGTTWSNCTIXKy^SYVANVPTTQPGGFIPSDFSFNNWFIiLmSTLVSGKLVTK 300 
WSAGEIXSIYYEPCTANCTGYAANVFATDSNGHIPEGFSFNNWFIiIiSNDSTIiLHGKVVSN 274 
lOJPN-LGNQRVEIitffiWDTGWSaiYKRNFmWKADYLYFHFYQEGGTPYAYPTD^^ 230 
DCKPNTNGmiGFWHTIWPIC\aiKRNFTLNVNAI]AFYFHPYQHGGTFYAYYADK^^ 225 
-CTFEYISDAFSIiin^ERS6NPKUijREFVPKNlUX3PLYVYK6YQFIDWRDIiPSGPNTIiK 217 
SSGMAWSSSQFCTAH(a}FSDTTVFVTHCYKHGG--CPLTGa!!I^ 131 



QPLLIJraMSVSGLRFmPVYPNGTGRGIXaCGPSSDVLSCVIRYlOIPEBN^ — T 131 
QPIiWaMPVPSFEEAASTFCPEGAGFDQOTGAVIiNNWDVIRFNriNFT^ 360 

QPU;jVNCIJJ^PmGIiGQFPSFNHTlSmGVCNGAAVDRAPEAIJlFNIHDTSVILAEG--S 332 

TKFIJTJVYLGTVLSHYYVMPLTCN SALTLEYWVTPLTSKQYIiAFNQDGVIFNAVD 286 

TTFIJSVYIGDILTQYrVLPFICNPTAGSTPAPRYWVTPLVKRQYLFNFNQKGVITSAVD 285 
PIPiOfI£ZinXI!FRAZI.TAFSPAQDIW6TSAAAYFVGYI£PTTEm£3nSNGTIT^^ 277 
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LIRVSMKNGQIJiraLWSVaKYPTFRSFQCVIiNLTSVYIiNGDLVYTSNETIDVTSAGVY 191 



IIiFKTSYG VVVFYCTNNT-LVSGDAHIPFGTVIyGNFYCFVNTTIGNETTSAFVGAL 186 

WSLNTTGGVTLEISCTfWSDS-SFFSYGEIPFGVTDGPRYCifVHy NGTALKYLGTL 416 

IVLETALG mSFVCSNSSDPHLMFMPIGATEVPYYCFLKVDTYNSTVYKFLAVL 389 

OCSDFMSBIKCKTIiSIAPST6VYELNGYTVQPIM)VYmPNLPDCN-IEAS«LNDKSVPS 345 
OVSSYTSBIKCKTQSMLPSTGVYELSGYTVQPVGVVYRRVMttiPACN-IEEWLTARSVPS 344 
CS(^Ii&EUCCSVKSFEIDK6IYQTS]!lFinnn?S6DVVR-FFIirniIiCPF(SVFII&TKFPS 336 
FKAGGPITYKVMREVKAMYFVireTAQmiiCDGSPRGLLACQYlJTGNFSDGFYPFTNSS 251 



PKTVREFVISRTGHFYINGYRYFTIiGNVEAVNFNVT TAETTDFCTVALASYADVLV 242 

PPSVKEIAISKWGHFYIN6YNFFSTFPIDCISFNLT TGDSDVFWTIAYTSYTEALV 472 

PPTVREmTKYGDVYVJrcFGyLHLGlJjimVTIKFTGHGTDDDVSGFWTIASTNFVDALI 449 

PIJJWBRKTFSNCNFNMSSI^FIQAYSFTaNNIDAA KIYGMCFSSITIDKFAIPNG 401 

PLNWERKTFQlCNPNLSSIiIJflYVQAESLFCM^ KVYGRCFGSISVDKFAVPRS 400 

VXAMERKKISBCVA£YSVLYBSTFFSTFKCYGVSAT RUlDIiCFSlSVYADSFVVKGD 392 

LVKQKFIVYRENSV!m?CrriiHNFIFHNETGANPNPSG--VQNIQTYQTKTAQSGYYNFNF 309 



WSQTSIANIIYC2JSVINRLRCr)QLSFDVPr)6FYSTSP--IQSVELPVSIVSLFVYHKHT 300 
QVENTAITKmaiSHVNNIKCSQITANLNNGFYPVSS--SBVGLVNKSVVIiLPSFYTHT 530 
EVQGTSIQRILYCDDPVSQIiKCSQVAFDIiDDGFYPISSRNLLSHEOPISFVTLPSPNDHS 509 
•RKVDLQLGNL6YLQSFNYRIDTTATSCQLYYMLPAANVS— VSRFNPSTWNRRFGFTEQS 459 
RQVDLQLGNSGFLQTAiram)TAATSCQLHYTLPKNHVT--niNHNPSSWNRRYGFNDAG 458 

EVRQIAPGQT6VIJU]YIIXK6P0DF1«K:VL&W^ STGHYNYORYIiREG 446 

SFLSOTVYKESNFMYGSYHPSCKFRLETINNGLWFNSLS VSIAYGPLQGGCKQS 363. 



FIVLYVDFKPQ-SGGGKCFNCYPA6VNITLANFNET KGPLCVDTSHFTTKYVAVYAN 356 

IVNITIGLGMKRSGYGQPIASTIjS--NITLPMQDHN TDWCIRSDQFS-VYVHSTCK 584 

FVNITVS AAFGGLSSANLVASDTTINGFSS FCVDTRQFTITLFYNVTN 557 

VFKPQPAGVFTDHIAWYAQHCFKASTNPCP(3aJ>3SLCVGNGP6IDAGYKTSGIGTCPAG 519 

VFGKN QHDWYAQQCFTVRSSYCPC 483 

KLRPFER 453 

VFKGRAT 370 



VGRWS ASIOTGNCPFSFGKVNNFVKFGSVCFSLKDIPGGCAMPIVA 402 

SAlM)NIFKRNCTIMiDATAVIKTGTCPFSFDKIiNmTFNKFCIiSLSPVGANCKFDVAA 644 

SY6YVS KSQDSNCPFTLQSVNDYLSFSKFCVSTSLLAGACTIDLFG 603 

TNYLTCBNAAQOXrLCTPDPITSKATGPYKCPQTKYLVGIGEHCSGLAIKSDHCG G 575 

AQPDIVSPCTT QTKPKSAFVNVGDHCBGLGVLEDNCGNADPH 525 

DISlSIVPFSPIXaCPCTPPAIiJICYWPIililD 480 

CCYAYSYGGPSLCKGVYSGELDHNFECGL 399 



NWAYSKY YTIG SLYVSWSDGDGITGVPQPVEGVSSFMNVTLDKC 446 

-RTRTNE QWR SLYVIYEEGlPn^/GWSDNSGVHDIiSVIiHL^ 687 

YPAFGSG VKLT SLYFQFTKGELITGTPKPLEGITDVSFICnjDVC 647 

NPCTCQP QAPI/MSVDSOjQGDRCMFANFIIiHDVNSGTTCSTDIiQKSNTDIILGVC 632 

KGCICAN NSFI®»SHDTCLVNDRCQIFANIIiIiNGINSGTTCSTDLQLPNTEWTGIC 582 

YSFYTTTGIGy QPYBVVVLSPEIiIJJAPATWMPKLSTDLIKHQC 524 

LVYVTKS GGSRIQTATEPPVITQNNYHNITLNTC 433 
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TKYNIYDVSGVGVIRVSlIDTFIiN GITYTSTSGNLLGFKDVTKGTIYSITPC 

TDYNIYGRT6VGIIRQTNRTLLS GLYYTSLSGDLLGFKNVSDGVIYSVTPC 

TKYTIYGFKGEGIITLTNSSILA GVYYTSDSGQLIAFKNVTSGAVYSVTPC 

VNYDLYGITGQGIFVEVMATYYNS WQNLLYDSNGNLYGFRDYLTNRTFMIRSC 

VKYDLYGITGQGVFKBVKADYYNS WQTLLYDWGNLNGFIffiLTTNKTYTIRSC 

VKFHFNGLTGTGVLTPSSKRFQP FQQFGRDVSDPTOSVBDPKTSKILDISPC 

VIlYmGRTGQ6PITN\m)SAVSYHYIJfflAGLMH)TSGSIDIFWQGEYGIiNYYKVNPC 



HPPDQLWYQQA--VVGAMIiSENFTSYGFSN\/VELPKFFYASNGTYN 

DVSAQAAVIIX3T--IVGAITSINSEIIiGLTHWTTTPNFYYYSIYiraM)RTRGTAIDSND 

SFSEQAAYVNDD— IVGVISSLSNS— TFNNTRELPGPFYHSNDGSN 

YSGRVSAAFHAN--SSEPALLFRNIKOJYVFNim.SRQLQPINYFDSY]:iGCVVNAmSTS 
YSGRVSAAFHKD--APEPALLYRraCSYVFSNNISREENPLbreFDSYIGCVV^^ 
AFGGVSVITPgimSSgVAVLYOD raC TDyST A imfflOlTPJOlRlYgKa^^ 
BDVNQQFWSGGK-LVGILTSRNETGSQLLENQFYIKITNGTRRFRRSITEN 



-CTDAVLTYSSFGVCADGS IIAVQPRHVSYDSVSAIVTAN LSI 

VDCBPVITYSNIGVCKMGA FVFDJVTHSDGD-VQPISTGN VTI 

-CTEPVLVYaUGVCKSGS IGYVPSQYGQVK-IAPTVTGN ISI 

SWQTCTLTVGSGYCWSTKRRSRRSITTGYRFTNFEPFTVNSVNDSLEPVGGLYEIQI 

EALPNCDLRMGAGLCVDYSKSRRADRSVSTGYRLTTFEPYTPMLVNDSVQSVDGLYEMQI 

IGAEEVI)TSYECn)IPIGA6ICASmVSIIASTSQKSIVAYTHSIiGAI)SSIAYSin»Tm 

VANCPYVSYGKFCIKPDGS lATIVPKQLEQFVAPLFNVTEN VLI 

. * 



PSNWTTSVQVEYLQITSTPIVVrcsmQJGm/RCVEIJIiKQYTSAaOTIEDALRNSARLE 

PTNFTISVQVEYIQVYTTPVSIIX:SRYVCMlM>R(m[J.TQYVSACQTIEQAIAMGARIiE 

PTNFSMSmEYLQLYNTPVSVDCATYVCNGNSRCKQIjLTQYTAACKTIESAIiQLSARIiE 

PSEFTIGNMEEFIQTSSPKVTIDCSAFVCGDYAACKSQLVEYGSFCramiAILTEVNELL 

PTNFTIGHHEEPIQmPK\rPIIXaAFVCOTroAaiQQLVBYGSFCVNVmLNEVNNLL 

PTOFSISITCEVHPVSH&inSVlXIHUnCSDSlSCAinJiIiQYGSFC^ 

PNSPNLTVTOEYIQTRMDKVQINCLQYVCGSSLDaRKIiFQQYGPVCDNILSVVNSVGQKE 



SADVSEMLTFDKKAFTIANVSSFGDYN LSSVIPSLPTSGS 

lOJEVDSMIJVSENALKLASVEAFNSSBTLDPIYKEWPNIGGSWLEGLKYILPSHNS 

SVEVNSMLTISEEALQLATISSFNGDG YNFTNVLGASVYDPASGR 

»TTQI<3VMSLMGVTIiSTKLKDGVNFN ^VDDINFSPVLGCIiGSDCN 

DNMQLQVASALMQGVTISSRLPDGISGP IDDDJFSPLLGCIGSTCAED6N 

DRBTREVFAQVKQIlXKrFTLKYFGGFH FSQILPDPLKPTK 

MJELLNFYSSTKPAGFNTPVLSNVSTG EFNISLLLTNPSSRRK 



RVAGRSAIEDILFSKLVTSGL6TVDADYKKCTKGLS- -lADLACAQYYNGIMVLPGV 

IOanroSAIEDLLFDKmSGIGTVDEDYKRCTGGYD--IADLVCAQYYMGIMVIiPGV 

WQKRSVIEDIJ.PNKV\/TNGL6TVDEDYKRCSNGRS--VADLVCAQYYSGVMVLPGV 

KVSSRSAIEDLLPSKVKICTVG-FVEAYNHCTGGAE--IRDLICVQSYMGIKVI 1 PPL 

GPSAIRGRSAIEDLIJTDKVKLSDVG-FVEAYNNCTOGQE--VRDIiI.CVQSFNGimPPV 
RSFIEDiIJflIKm<An&G-EmgY6B(n:rf^ -ARm>ICAQKni6LWLPPL 

RSLIEDLIJTSVESVGLP-TiroAYKNCTAGPLGFFKDIiACAREYNGLLVLPPI 

** . * * ***..*. . 



497 

738 

698 

685 

635 

576 

493 



542 

796 

741 

743 

693 

636 

544 



584 

838 

782 

803 

753 

696 

588 



644 

898 

842 

863 

813 

756 

648 



684 

954 

887 

909 

863 

796 

691 



739 

1009 

942 

963 

920 

846 

743 
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SEQ ID NO: 6053 
SEQ ID NO: 6057 
SEQ ID NO: 6061 
SEQ ID NO: 6065 
SEQ ID NO: 6069 
SEQ ID 110:6042 
SEQ ID NO: 6072 



SEQ ID NG:6053 
SEQ ID NO: 6057 
SEQ ID NO: 6061 
SEQ ID NO: 6065 
SEQ ID NO: 6069 
SEQ ID 110:6042 
SEQ ID NO: 6072 



SEQ ID NO: 6053 
SEQ ID NO: 6057 
SEQ ID NO: 6061 
SEQ ID NO: 6065 
SEQ ID NO: 6069 
SEQ ID NO: 6042 
SEQ ID NO: 6072 



SEQ ID NO: 6053 
• SEQ ID NO: 6057 
SEQ ID NO: 6061 
SEQ ID NO: 6065 
SEQ ID NO: 6069 
SEQ ID !K>:6042 
SEQ ID NO: 6072 



SEQ ID NO: 6053 
SEQ ID -NO: 6057 
SEQ ID NO: 6061 
SEQ ID NO: 6065 
SEQ ID NO: 6069 
SEQ ID NO: 6042 
SEQ ID NO: 6072 



SEQ ID NO: 6053 
SEQ ID NO: 6057 
SEQ ID NO: 6061 
SEQ ID NO: 6065 
SEQ ID NO: 6069 
SEQ ID NO: 6042 
SEQ ID NO: 6072 



ADAERMAMYTGSLIGGIALGGLTS AVSIPFSIMQMUiNYVMKJTDVLQENQKIIA 795 

ANADKimiYTASIiAGGITLGMiGGG AVMPPAVAVQAIttjgyVMiQTDVIinOIQQI^ 1066 

VDAEKLHMYSASLIGGMALGGITA AAALPFSYAVQARIMLALQTDVLQRNQQLIiA 998 

LSENQIS6YTLAATSASLPPPWSA AAGVPFYUJVQYW3JG1GVTMDVLSQNQKLIA 1019 

LSESQISGYTTGATAAAMPPPWSA AAGVPFSLSVQYRINGLGWTMNVLSENQKm 976 

LTDEHIAAYTAALVSGmAGffTFGAGAAIiQIPFAlIQHAntroGIGVTQNVIiYENQRQIA 906 
ITAMQALYTSSLVASMAFGGITA AGAIPFATQLQARINHLGITQSLLIjKNQEKIA 799 



ASFNIOMTNIVDAFTGVNDAITQTSQAI.QTVATAIJlKIQD\/VNQQGNSIiNHLTSQLRQNF 855 
SAFNQAI6ST[TQSFGKVNDAIHQTSRGMTVAKAIJ«0/QDVVNIQGQALSHLTVQI.QNNF 1126 
ESPNSAIGNITSAFESVKKAISQTSK6IOTVAHALTKVQEWNSQGSAUJQLTVQLQHNF 1058 



NAFNNALGAIQE6FDATN SALVKIQAVVNANAEAUJNLLQQLSHRP 1065 

SAFNNALGAIQD6PDATN-. SALGKIQSWNANAEADINLWQLSNRF 1022 

NQFNKAISQIQESI/ITTS -^-TAI£KIiQDVVNQNAQAIJITLVKQLSSNF 952 

ASFNKAIGHMQBGFRSTS LALQQIQDWSKQSAILTEIMASLNKNF 845 



QAISSSIQAIYDMIJTIQADQQVDRLITGRLAAIJJVFVSHTLTKyTEVRASRQLAQQKVN 915 
QAISSSISDIYNRLDELSADAQVDRLITGRLTALNAFVSQTLTRQABVRASRQLAKDKVN 1186 
QAISSSIDDIYSRLDILSADVQVDRLITGRLSALNAFVAQTLTKYTEVQASRKLAQQKVN 1118 
GAISSSLQEILSRLDALEAQAQIDRLINGRLTALNAYVSQQLSDSTLVKFSAAQAMEKVN 1125 
GATSASLQEILTRLEAVEAKAQIDRLTNGRLTALNAYISKQLSDSTLIKVSAAQAIEKVN 1082 
6&ISSVLmiLSiaanCVEAEVQinRLITGRI.QSLQ!mi!QQLIRAAEIRASANLAATKIIS 1012 
GAISSVIQEIYQQFDAIQANAQVDRLITGRLSSLSVLASAKQAEYIRVSQQRELATQKIN 905 

iri'ir** ic » * m • ic . it ^ icirk-k -kic 1c • it » • km 



ECVKSQSKRYGFCG-NGTHIFSIVNAAPECaiVFIJm/LLPTQyKDVEAWSGIiCVDG 970 

BCVRSQSQRFGFCX5-NGTHIJSLMIAAPNGMIFFHmLPTAYBTVTAWP6ICASDG-DR 1244 

BCVKSQSQRYGFCGGDGEHIFSLVQAAPQGLLFLHTVLVPGDFVNVLAIAGLCVNG 1174 

ECVKSQSSRINFCG-NGNHIISLVQNAPYGLYPIHFSYVPTKYVTAKVSPGLCIAG 1180 

BCVKSQTTRINFCG-NGNHILSLVQNAPYGLYFIHFSYVPISFTTANVSPGIiCISG 1137 

ECVLGQSKKVDFCG-KGlHLHSFPQAAPEGVVFLHmVPSQEBNFTTAPAICHEG- 1067 

ECVKSQSIRYSFCG-NGRHVLTIPQNAPNGIVFIHFSYTPDSPVNVTAIVGFCVKPANAS 964 

kkk km k kkk ^k ★ . . . • . ** *« *-* * ^ , 1 * 



TNGYVLRQPNLALYK EaiYYRITSRIMFEPRIPTMADFVQIENCaJVTFVNISRS 1024 

TFGLWKDVQLTLFRN LDDKFYLTPRTMYQPRVATSSDFVQIEGCDVLFVNATVS 1299 

EIALTIJffiPGLVIJTHELQTYTATEYFVSSRRMFEPRKPTVSDFVQIESCVVTYVNLTSD 1234 

DRGIAPKSGYFVNVNN TWMFTGSGYYYPEPITGNNWVMSTCAVNYTKAPDV 1232 

DRGLAPKAGYFVQDDG EWKFTGSSYYYPEPITDKNSVIMSSCAVNYTKAPEV 1189 

-KAYFPREGVFVPNOT SnFITQRNFFSPQIITTDNTFVSGNCDWIGIINNT 1118 

QYAIVPANGRGIFIQVN GSYYITARIMYMPRAITAGDWTLTSCQANYVSVNKT 1018 

• • • k k m k 



ELQTIVP-EYIDVNKTIiQELSYKL-PNYTVPDLVVEQYNQTIIiNLTSEISTLENKSAEtN 1082 
DLPSIIP-DYIDINQTVQDILENFRPNWTVPELTFDIFNATYIiNLTGEIDDLEPRSEKLH 1358 
QLPDVntP-DYIDVNKTLDEILASL-PNRTGPSLPLDVPNATYLNLTGEIADLEQRSESLR 1292 



lffjnSTP-NIiPYFKEEI^WPKNQTSVAPDI.SLDY--INVTFLDLQDEMN 1279 

FLNTSIP-NPPDPKBELDKWPKNQTSIAPDLSIDFEKIJJVTLIiDLTYEMN 1238 

VYDPLQP-ELDSFKEELDKYFKNHTSPDVDLGDISG-INASWNIQKEID 1166 

VlTTPVraiDDPDPNDBLSKWWND!EKHELPDPDKFN--YTVPILDIDSEID 1066 
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SEQ ID NO: 6053 
SEQ ID HO: 6057 
SEQ ID NO: 6061 
SEQ ID NO: 6065 
SEQ ID NO:6069 
SEQ ID NO: 6042 
SEQ ID NO: 6072 



SEQ ID NO: 6053 
SEQ ID NO: 6057 
SEQ ID NO: 6061 
SEQ ID NO: 6065 
SEQ ID NO: 6069 
SEQ ID NO:6042 
SEQ ID HO: 6072 



SEQ ID NO: 6054 
SEQ ID NO: 6062 
■SEQ ID NO: 6058. 
SEQ ZD NO: 6045 
SEQ ID NO: 6073 
SEQ ID. NO: 6066 



SEQ ID NO: 6054 
SEQ ID HO: 6062 
SEQ ID NO: 6058 
SEQ ID NO: 6045 
SEQ ID NO: 6073 
SEQ ID NO: 6066 



SEQ ID HO: 6055 
SEQ ID NO: 6063 
SEQ ID NO: 6059 
SEQ ID HO: 6067 
SEQ ID NO: 6070 
SEQ ID NO:6046 
SEQ ID HO: 6074 



SEQ ID HO: 6055 
SEQ ID NO: 6063 
SEQ ID NO: 6059 
SEQ ID HO: 6067 
SEQ ID NO: 6070 
SEQ ID NO:6046 
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YTVQKLQTLIDNINSTLVDIjKWlJmVETYIKWPWWVWLCISVVLIFWSMI^^ 1142 

NTTVEmLIIMINNTLVHIiEWIjNRIETYVKWIWYVWIiLIGLVVIFCIPIiI^^ 1418 

NTTEEIJlSLINNIHNTLVDI^nNRVEniKWPWWmilVlVLIFWSLLVFCCISTGC 1352 

K6QEAIKVMQSyiNLKDIGTYEYYVKWI>WYVWHiIGFAGVAMLVII.FFICCC^ 1335 

RIQDAimJJESmiiKEVGiraimWPWyVWiLIGIAGVAVCVLI^ 1294 

RLNEVAKNUlESLZDIiQEL6KXEQYIKNFNZVNL6FIA(3iIAIVHVTIL^^ 1222 

RIQGVIQGm)SLIDIiEKLSII.miIOTPWYVWLAIAFATIIFILILGWVPPOT^ 1122 

. ,-k . ..t. . . it o-kicirk •ie'k-k ... * t 

• • •••••••••»• • • • • • 



CGFFSCFA SSIRGCCESTKLPYYD-VEKIHIQ 1173 

CGCIGCLG SCCHSICSRRQFENYEPIEKVHVH 1450 

CGCCGCCG ACFSGCCRGPRLQPYEAFKKVHVQ 1384 

G— TSCFK KCGGCCDDyrGHQELVIKT SHED- 1364 

G— SCCFK KCOICCDByGGHQDSIVIHNISSHED- 1326 

CSCLKGAC SC6-SCCKFDEDDSEFVLR6VXLEIT- 1255 



CGC(XX3CFGIMPUISKC»KKSSYYTTFDNDVVTEQYRPKKSV 1164 



FIGURE 7B 

-MFLKLVDDHA-LVVNVLLVOVLmLLVCITIIKLIKI^FTaMFOIRTVY 51 

MLQLVHiaro-LVVNVILWIiFVI<FFIJiIISITPVQLVNI/:FTaiRIiafS^^ 54 

^MTFPRALTVIDDNG-MVnOTIFWFIJ:.IIILIII.SIAIiLNIIKLaiVCC^^ 59 

HISFVSEETmiVHSVIIflAFVVFLLVTIAII/lAIi^^ 52 

MNLIMSI.EENG-SFLTALYIIVGFIJ^YIJiGRAI.QAFVQAADACCLFWYTWW 57 

MFMADAYFADTVWYVGQIIFIVAiajLVIIVWAFLATFKIiClQLCGMCNTLVL 54 



GP IKNVYH-IY-QSYMH IDPF PKRVIDF 77 

TP IGRLYR-VY-KSYMR IDPL PSTVIDV- 80 

VP AQHAYD-AY-KNFMR IKAYN PD6ALLA 86 

KP TVYVYS-RV-KNLBS SEGV PDLLV- 76 

IPGAKGTAFVYIOT^GIUajJNPELEAVIVNEFPKNGWNNKNPANFQDAQRDKLYS 112 

SP SIYVFN-RG-RQFYEF YHDVKP PVLDVDDV 84 

* : : . . * 



FIGURE 7C 



MSNDNC TGDIVTHLKNWNF 19 

MSNGSIP- ^VDEVIEHERHWNF 24 

MIII,IIACVIA(y^ERYC7aiKSIXIT»LSaaJSTASIX:ESCFNGGDIJVrai^^ 60 

MSSVT-TPAPVYTWT ADEAIKFLKEWNF 27 

MSSTTQAPEPVYQWT ADEAVQFLKEWNF 33 

HADNSTIT— VEELKQUEQWNL 21 

MPNBTNCTLD FEQSVQLFKEYHL 28 

k ^ k . 



GWNVILTIFIVn<}FGHYKYSRLPYGIiKMLVLmiWPLVIJU:.SIFDTWANWDSN-WAFVA 78 
TWNIILTILLWIiQYGHYKYSVFLYGVKMAILWIUJPLVLALSLFDAWASFQVN-WVFFA 83 
SWSIIIJVPITVLQYGRPQFSWFVYGIKMLIMWLLWPVVLALTIFNAYSEYQVSRYVMFG 120 
SIGIILLPITVILQFGYTSRSMPVYVIKMVILWIJ(IWPLTIILTIFN--CVYAIiN-NVYlG 84 
SIGIILLFITIILQF6YTSRSMPIYVVKMIILWIiIWPLTIVLCIFN--CVyAIiN-NVYLG 90 
VIGFIfIAim!IiIiQFATSNRHmYIIKLVFLNUA?PmACFVIiA--AVYRin-nVT66 78 
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SEQ ID NO: 6074 



SEQ ID HO: 6055 
SEQ ID NO: 6063 
SEQ ID NO: 6059 
SEQ ID NO: 6067 
SEQ ID NO: 6070 
SEQ ID NO: 6046 
SEQ ID NO: 6074 



SEQ ID NO: 6055 
SEQ ID NO: 6063 
SEQ ID NO: 6059 
SEQ ID NO: 6067 
SEQ ID NO: 6070 
SEQ ID BO: 6046 
SEQ ID NO: 6074 



S^ ID NO: 6055. 
SEQ ID NO: 6063 
SEQ ID NO: 6059 
SEQ ID NO: 6067 
SEQ ID NO: 6070 
SEQ ID NO: 6046 
SEQ ID NO: 6074 



SEQ ID NO: 6056 
SEQ ID NO: 6064 
SEQ ID NO: 6060 
SEQ ID NO: 6068 
SEQ ID NO: 6071 
SEQ ID NO:6051 
SEQ ID NO: 6075 



SEQ ID NO: 6056 
SEQ ID NO: 6064 
SEQ ID NO: 6060 
SEQ ID NO: 6068 
SEQ ID NO: 6071 
SEQ ID N0:6051 
SEQ ID NO: 6075 



SEQ ID NO: 6056 
SEQ ID NO: 60 64 
SEQ ID NO: 6060 
SEQ ID NO: 6068 
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FITAFLLFLTIILQY 6 YATRSKVIYTI 1 KMIVI 1 WCFWPI 1 NIAVGVIS--CTYPPN-TGGLV 85 
:: :**: : 



FSFFMVSTLVMWVMYFANSFRLFRRARTFWaWNPBVNAITVTTVL-GQTYYQPIQQAPT 137 
FSILMACITDIUmJYFVNSIRLWRRTHSWWSFNPEPDMiLTTSVM-GRQVCIPVLGAPT 142 
FSIAGAIVTFVIOTIYFVRSIQLmTKSWWSFNPBTKMLCVSAL-GRSYVIiPLBGVPT 179 
FSIVFTIVAIIMWIVYFVNSIRLFIRTGSWWSFNPBTNNIilCIiaiK-GRMYVRPIIEDYH 143 
FSIVFTIVSIVIWIMyFVNSIRLFIRTGSWWSFNPErNNIJlCII»!K-GTVYVRPIIEDYH 149 
lEIMI&CIVGIiHNLSyE^SUSFIUiFERTRSMSFNFBTNIIiIiNVPIiR-GTIVTRPUlESEL 137 
AAIILTVFACLSFVGYWIQSIRLFKRCRSWWSFNPESNAVGSILLTNGQQCNFAIESVPM 145 

• • • ^ -k , , 1c±i( 4 k • 



GITVTLLS 6 VLYVDGHRLASGVQVHNLPEYMr\/AVPSTTIIYSRVGRSVNSQNSTG--WV 195 
GmTLLSGTLLVBGYIWATGVQVSQLPNFVTVAKATTTIVYGRVGRSVNASSGTG--WA 200 
GmTLLS(MiYAEGFmGa(DramPKYVMVALPSRTIVYTLVGKKLKASSATG--WA 237 
TLTVTIIRGHLYMQGrKLGTGYSLSDLPAYVTVAKVSHIiLTYKRG-FIiDKIGDTSG— FA 200 
TLTATIIRGHLYMQGVnX3TGFSLSDIjPAYVTVAKVSHIiCTYKRA-PIiDKVDGVSG--FA 206 
VIGAVIIR6HIiBBAGHSLGR-CDIKDIiPKEITVATS-BXIiSYYKLGASQR'\n?rDSG--FA 193 
VKPIIKNGVLYCEGQWLAK-CEPDHLPKDIFVCTPDRRNIYRMVQKYTGDQSGNKKRFA 204 

• 4 * * * 4 ** . * * , . 

4444»4 444 

FYVRVKHGDFSAVSSPMSNMTENERLLHFF 225 
FYVRSKHGDYSAVSNPSAVLTDSEKVLHLV 230 
YYVKSKAGDYSTEAR-TDNLSEQEKLLHMV 266 
VYVKSKVCaiYRLPSTQKGSGLDTALIiRNNI 230 
VYVKSKVGNYRLPSN-KPSGADTALLR— I 233 
AYNRYia6NYKU9TDEA6SNDNIAIiLVQ-- 221 
TFVYAKQSVDTGELESVATGGSSLYT 230 



FIGURE 7D 



MATVKWADASE PQRGRQG 18 

MASVSF QDRGRK 17 

MANQGQRVSWGDEST KTRGRSNSRGRKN 31 

MSFTPGKQSSS-RASSGNRSGNGILK WADQSDQSRHVQTRGRR-AQPKQTATSQQPS 55 

MSFVPGQENAGGRSSSVNRAGNGILKKTTWADQTERGPNNQNRGRR-NQPKQTATTQ-PN 58 

HSDNGPQSNQRSAPRITFGGPTDSTDNNQNGGRNGARPKQRRPQGLPN 48 

MASGKAAGKTDAPAPVIKIiGGPKP PKVGSS 35 



RIPYSLYSPLLVDSEQPW-KVIPRNLVPINKK-DKNKLIGYWN— VQKRFRTRKGK 70 

RVPLSLYAPLRVTNDKPLSKVLANNAVPTNKG-NKDQQIGYWN— ^IRWRMRRGE 70 

^NNIPLSFFNPITLQQGSKFWNLCPRDFVPKGIG-NRDQQIGYWN— RQTRYRMVKGQ 85 

GGNWPYYSWFSGITQFQKGKEFEFABGQGVPIAPGVPATEAKGYWYRHNRRSFKTADGN 115 
SGSWPHYSWFSGITQFQKGKEFQFABGQ 6 VPIANGIPASEQKGYWYRHNRRSFKTPDGQ 118 

NT ASWFTALTQHGK-BELHFPRGQGVPIHTffiSGPDDQIGYYRHATRR-VRGGDGK 101 

GNASWFQAIKAKKIOTPPPKFBGSGVPDNENIKPSQQHGYWR— RQARFKPGKGG 88 

* • • ** **. 4 4 * 

• • • ♦ * • • 



RVDLSPKLHFYYLGTGPHKDAKFRERVEGWWVAVDGAKTBPTGYGVRRKNSEPEIPHFN 130 
RIEQPSNWHFYYLGTGPHGDLRYRTRTE 6 VFWVAKBGAKTEPTNLGVRKASBKPIIPKFS 130 
RmPERWFFYYLGTGPHADAKFKDKLDGVVWVAKDGAMNKPTTLGSRGANNESKALKFD 145 
QRQLLPRWYFYYLGTGPHAKDQYGTDID 6 VYWVASNQADVNTPADII 1 DRDPSSD— EAIP 173 
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SEQ ID NO: 6071 QKQLLPRWYFyYL6TGPHAGASyGDSIEGVFWVANSQAD®ITRSDIVKRDPSSH--EAIP 176 
SEQ ID BO: 6051 IQaBL£n?iam7TIiGTOPE&SIiFy6&NKE6IVWVATEGAUf^ 159 
SEQ ID NO: 6075 RKPVPDAWYFYYTGTGPAADDJWGDTQDGIVWVAAKGADTKSRSNQGTRDPDKF--DQYP 146 

_*** **** . *** * _ 



SEQ 


ID 


NO 


6056 


SEQ 


ID 


NO 


6064 


SEQ 


ID 


NO 


6060 


SEQ 


ID 


NO 


6068 


SEQ 


ID 


NO 


6071 


SEQ 


ID 


BO 


6051 


SEQ 


ID 


NO 


6075 



QKLPN5VTWEE PDSRltfSRSQSR SQSRGRGESK 164 

QQLPSWEIVEPNTPPASRANSRSRSRGatRaJNRSRSPSNNRGNNQSRGNSQNRGNNQGRG 190 

GKVPGBFQLEVN QSRDNSRSRSQ SRSRSRNR 176 

TRPPPGTVLPQGYYIEGS-GRSAENSR STSRASSRASS 210 

TRFAPGTVLPQGFYVEGS-GRSAPASR SGSRSQSRGP 212 

LQLPQGTTLPK6EYAEGSRGGSQ&SSR SSSRSR6RSR 196 

LRFSDGGPDGNFRWDFIPLNRGRSGRS TAASSAAASR 183 



SEQ 


ID 


NO: 6056 


SEQ 


ID 


NO: 6064 


SEQ 


ID 


N0:6060 


SEQ 


ID 


NO: 6068 


SEQ 


ID 


NO: 6071 


SEQ 


ID 


BO: 6051 


SEQ 


ID 


NO: 6075 



PQSRNPSSDRNHN SQDDIMKAVAAALKSLGPDKPQEKDKKS 205 

ASQNRGGNNNNNNKSRNQSNNRNQSNDRG6VTSRDDLVAAVKDALKSLGI6ENPDRHKQ- 249 

SQSRGRQQFNNKK DDSVEQAVLAALKKLGVDTEKQQQRS- 215 

AGSRSRANSGNRT PTSGVTPDMADQIASLVLAKLGKDAAKP 251 

NNRARSSSNQRQ PASTVKPDMAEEIAALVLAKLGKDAGQP 252 

BSTPGSSRCfflS: PABMASGGGKCALAT.TJiTiDRmOIgSKy 235 

APSREGSRGRR SDSGDDLIARAAKIIQDQ 212 



SEQ ID NO: 6056 
SEQ ID NO: 6064 
SEQ ID NO: 6060 
SEQ ID NO: 6068 
SEQ ID NO: 6071 
SEQ ID NO: 6051 
SEQ ID NO: 6075 



AKTGTPKPSRNQSPASSQTSAKSIARSQSSETKEQKHEMQKPRISKRQPNDDVTSNVTQCF 265 

QQKPKQEKSDNSGKNTPK — KNKSRATSKERDLKDIPEWRRIPKG— ENSVAACF 300 

— RSKSKERSNSKTRDTTPK NENKHTWKRTAGK GDVTRFY 253 

QQVTKQTAKEIRQKILN KPRQKRSPNK— QCTVQQCF 286 

KQVTKQSAI^niQKILN KPRQKRTPNK— QCPVQQCF 287 

SCTSQQQQGQTVTKKSlAEaSK- KPRQKRTATK— Q3nSVTQAF 275 

QKKGSRITKAKADEMAH RRYCKRTIPP— NYRVDQVF 247 

: ' : : * * : 



SEQ ID NO: 6056 GPRDLDH NFGSAGWANGVKAKGYPQFAELVPSTAAMLFDSHIVSKESG 314 

SEQ ID NO: 6064 GPRGGFK NFGDAEFVEKGVDASGYAQIASLAENVAALLFGGNVAVRELA 349 

SEQ ID NO: 6060 GARSSSA NFGrTOLVJUJGSSAKHYPQLAECVPSVSSILFGSYWTSKEDG 302 

SEQ ID NO: 6068 GKRGPMQ NFGGGEMLKLGTSDPQFPIIAEIJ^AGAFFFGSRIiEIAJWQNLSGNLIffi 343 

SEQ ID NO: 6071 GKRGPNQ NFGGSEMLKLGTSDPQFPIIAELAPTVGAFFFGSKLELVKKN--SGGADE 342 

SEQ ID NO:6051 GIUtGPEQTQGNFGDQDLIRQGTDlKENPQIAQFAPSASAFFGIISRIGtlEVTP 327 

SEQ ID NO: 6075 GPRTKGK-BGNFGDDKMNEEGIKDGRVTAMLNLVPSSHACLFGSRVTPKLQL 298 

-k k kkk k . k » • 



SEQ ID NO: 6056 
S^ ID NO: 6064 
SEQ ID NO: 6060 
SEQ ID NO: 6068 
SEQ ID NO: 6071 
SEQ ID N0:6051 
SEQ ID NO: 6075 



HTWLTFTTRVTVPKDHP HLGKFLBBLNAFTR EMQ 349 

DSYEITYNYKMTVPKSDP NVELLVSQVDAFKTGNAK-LQRKKEIQOJKRETTLQ 401 

DQIEVTFTHKYHLPKDDP KTGQFLQQINAYAR PSEVAKEQR 343 

PQKDVYELRYNGAIRFDSTLSGFETIMKVIJIENLNAYQQQD GTMNMSPKPQRQ— R 397 

PTKDVYELQY^VRFDSTIJGFEriMKVIJiENIiNAYQK^ GADWSPKPQRKGRR 398 

SGTWLTYEGAIKLDnKDPQFRDNVIIiLBKHIDAYKTFP PTE 368 

DGLHIJlFEFTTVVPa)DPQFI»m«CICI)QCVD6VGTRPKDDEPKPKSRSSSRPATRG 355 



SEQ ID NO: 6056 QHPLLNPSALBPNPSQTSP ATAEPVRDBVSIETDIIDEVN 389 

SEQ ID NO: 6064 ‘ QHBEAIYDDVGAPSDVTHANLEVTOTAVDGairAVEIINBIFDT^ 446 

SEQ ID NO: 6060 KRKSRSKSABRSEQDWPDALIQJYTDVPDDTQVBIIDEVTN 385 

SEQ ID NO: 6068 GQKNGQGENDNISVAAPKSRVQQNKIRELTAEDISIiIiKKMDBP- FTEDTSBI 448 

SEQ ID NO: 6071 QAQBKKDEVDNVSVAKPKSSVQRNVSRELTPEDRSLLAQILDDGWPDGLEDDSNV 454 
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SEQ ID 110:6051 -PKKDKKKranJBAQPLPQRQKKQPTVTLLPAA 399 



SEQ ID NO: 607 5 NSPAPRQQRPKKEmiKKQDDEADKALTSDEERNNAQLEFYDEPKVINWGnAAIjGENEL 414 



FIGURE 7E 






wo 2004/092360 



70/193 



PCT/DS2004/011710 



FIGURE 8 



( 1)1 



.10 



avian IBV partial 5'UTR 161- (1) 

HCOV-OC43 5’UTR (1) 
bovine CV 5'UTR (1) 
Consensus (1) 



-20. 



.30 



. Section 1 
42 



TATTAAAATCTTATTGTTGCTGGTATCACTGCTTGTTTTGCC 



avian IBV partial 5'UTR 161- 
HCOV-OC43 5TITR 
bovine CV 5'UTR 
Consensus 



(43) 

(43) 

( 1 ) 

( 1 ) 

(43) 



43 






eo 



70 



> Section 2 
84 



GTGTCTCACTTTATACATCTGTTGCTTGGGCTACCTAGTGTC 



(85) 

avian IBV partial 5’UTR 161- (85) 
HCOV-OC43 5’UTR <1) 
bovine CV 5’UTR (1) 
Consensus 



Section 3 

86 Itg JOO J10 126 

CAGCGTCCTACGGGCGTCGrGGCTGGTTCGAGTGCGAGGAAC 



(127) 

avian IBV partial 5UTR 161- (127) 
HCOV-OC43 5’UTR (1) 
bovine CV5TJTR (1) 
Consensus (127) 




169 



(169) 

avian IBV partial 5'UTR 161- (169) 
HCOV-OC43 5'UTR (33) 
bovine CV 5’UTR (33) 

Consensus (169) ttca 



GATTG GAGCGATTTGCGTGCGTGCA TCCCGC 

- ■ - SecUon 5 

180 .190 200 210 




CTGATCTCTTGTTAGATCTTTTCGTAATCTA 

Section 6 



( 211 ) 

avian IBV partial 5'UTR 161- ^09) 
• HCOV-OC43 5TJTR (68) 
bovine CV 5‘UTR (68) 
Consensus ^11) 



211 



220 



230 . 



24CL 



252 



AACTTTATAAAAACATCCACTCCCTGTAG'TCTATGCCTGTGG 

Section 7 




(253) 

avian IBV partial 5'UTR 161- (251) : 

HCoV-OC43 5'UTR(110)| 
bovine CV 5'UTR (110) 

Consensus (253) gcgtagatttttcatagtggtgtctatatt catttct gct 



avian IBV partial 5'UTR 161- (293) P;( 



(295) 295 .300 



.310.... 

SccTi 



320 



-Section 8 
336 



Consensus (295) gttaacagctttcagccagggacgtgttgtatcctaggc 



(337) 337 350 .360 373 

SEQIDNO:9910 • (335) TTT^GflAjG^AGS^AT^C^ATAG^iS^jtlACAACAT^ 

SEQIDNO:9919 :(189) 

SEQ ID NO: 9892 : (189) RG!Td-j^e.CA)6'd;fc^ 

consensus (33^ AGTG GCCCACCCA'TAGGTC^AATG 



- Section 9 
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FIGURE 9 



F1: AT-^G^G G^GTG 

TGC A CX5 TTT C 


(136-154 nt) 


SEQ ID NO: 
6021 


F2: GTG^TG^AT^C^TTCA 
C C CC G 


(152-172 nt) 


6022 


F3: CTTCACfG JtCT^GT^A 


(168-195nt) 


6023 


R1: AGAA-^Q^ CAC *h’C-Ae§GG^TG 
G TACAA G CCT C 


(307-329nt) 


6024 


R2: AAAf G^TATAf^C^ ^TATG 


(265-288nt) 


6025 


«3= Cfc ^TATG ^^Afc ^^^GCCCA 


(250-274nt) 


6026 
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FIGURE 10 



Section 1 

( 1)1 .10 22 ^ 

avian IBV 3’UTR (NC_001451) 27103- (1) GT21ACATAATGGACCTGTTGTTTCCTGGTACATTTT 

HCoV-OC^3*UTR partial (1) 

t)ovineCV3'UTR (1) 

Consensus (1) 

SecHon 2 



avian IBV 3*UTR (NC_001451) 27103- 
HCOV-OC43 3*UTR partial 
bovine CV 3'UTR 
Consensus 



(37) 3L 









72 



(37) GTTAAACACTATTTCTGTGCTTTCCTATCAATTATT 

( 1 ) 

( 1 ) 

(37) 

Section 3 



(73) 73 .80 ^90 108 

avian IBV 3'UTR (NCJQ01451) 27103- (73) ACAGGCATTGATTGTGATTATGTTCAATACTTAAGC 

HCOV-OC43 3*UTR partial (1) 

bovine CV3'UTR (1) : 

Consensus (73) 

^ Section 4 

(109) ip9_ ^30 _ W 

avian IBV 3’UTR (NC_001451) 27103- (109) TTCT’r’rTGGTTGCTTTTTGCTTA'TTGTATfGTTGCT 

HCOV-OC43 3’UTR partial (1) 

bovine CV 3’UTR (1) 

Consensus (109) 



(145) 145 . J50 160 J7Q . 180 

avian IBV 3'UTR (NC_001<^1) 27103- (145) GTGCTTTTTATTGTTGTGATTCTcKiTT^rT^M--]§ 

HCOV-OC43 3’UTR partial (1) JSAii 

bovine CV 3’UTR (1) - IsM li^AA^ 

Consensus (145) A gagaatgaac 

^ Section 6 



(181) 181 J90 220 216 

avian IBV 3’UTR (NC_001451) 27103- (179) TgJ^C^A@A/^TCAA%^^iGAGTlAA^GSA^ 
HCoV-OC43 yUTR partial (14) 
bovine CV 3'UTR (11) 

Consensus (181) 'CTT AT GTCGGCACCTGG'i’GGTAACCCCTC GCAGG 

— - - — . Section 7 

(217)217_ 230 g40 ^ ^52 

avian IBV 3'UTR (NCJ001451) 27103- (215) ^|i%©AT^AG~CTT6?r‘pT|b^^ 

HCOV-OC43 3’UTR partial (48) 

bovine CV 3’UTR (45) 

Consensus (217) AAAGTCGGG ATAAGGCAC TCTCTATCAG 
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FIGURE 10 (contd.) 



Section 8 




(325) 325 



Section 10 
360 




avian IBV 3’UTR (NCJD01451) 27103- P23) | 

HCOV-OC43 SmR partial (139) 

bovine CV 3’UTR (136) i 

Consensus (325) AAAGTTTTGTGTGGTAATGTATAGTGTTGGAGAAAG 

— Section 11 

(361) 361 ^ 396 

avian IBV 3’UTR (NC_001451) 27103- (358) Sat^^TGCC AG^ iCjlGGiccic -^g| 
HCoV<)C43 3’UTRpartial(175)^-^to----g|^tei5f^^ ’ 



bovine CV3'UTR (172) 

Consensus (361) tg AAAGACT 




(397) 397_ 



TGCGGAAGTAATTGCCGACAAG 

Section 12 

410 420 432 



avian IBV 3‘UTR (NC_001451) 27103- (393) GAipp^A^CAiTl^^^CCCAl^^GGi 
HCOV-OC43 3’UTR partial (208) 



bovine CV3'UTR^03) 1 

Consensi^ (397) TGCCCAAGGGGAAGAGCCAGCACG TT AAGTTA 

— — Section 1 3 

(433) 4M ,440 ,450 468 



avian IBV 3‘UTR (NC_001451) 27103- (429) A 




Consensus (433) CCA CCAGTAATTAGTAAATGAAT6AAGTTAATTAT 

Section 14 

(469) ,480 ,490 504 

ran IBV 3’UTR (NC_001451) 27103- (462) feG^T^^pAT^T’r^ASTTTATAGGCTAGTATAGA 

Hr!n\/-OrMA VI ITR nzkrtiol f07A\ 

fc 



avian 



(505) 505 5J3 

avian IBV 3*UTR (NC_001451) 27103- (498) GTT AGAGCA ’ SEQ ID NO : 9911 

HCOV-OC43 3’UTR partial (293) SEQ ID NO: 9920 

bovine CV 3’UTR (290) SEQ ID NO: 9893 

Consensus (505) 
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FIGURE 11 

SEQ ID NO: 



F-1 TCTATC-f^AlGGATGrrCT 
AGA T 


(245 ~ 265 nt) 


6027 


F-2 TTAGTT J AA— TTT^GT^Tx GT 

G AG T G G 


(318- 339 nt) 


6028 


M TAGTGTT^GAGyAtGT^TAAAGA 

G A A G 


(346 ~ 368nt) 


6029 


R-1 A^TT^GCCATA^T^ AACTT 
C A T A 


(458 - 476 nt) 


6030 


R-2 ACTAA^^T-G T— C -TAA 

AATT A T CT CC C 


(426 - 448 nt) 


6031 






6032 


R-3 T ^TC-GC-^— C^-GCA 
AC c G GG CC G 


( 375 - 395 nt) 


6033 


FIGURE 12 






Coils 


out|>ut tor unknown 
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76/m 

FIGURE 




FIGURE 



Catalytic Cys 



SEQ ID NO: 6569 



1 sglrto^fps gkvegcmvqv e^^ttlngl 
61 ksnhsflvqa giinrqlrvigh 
121 spagvyqcam rpnhtikg^fllng^^svgf 
181 f ygpfvdrqt aqaagtdtti^V^nvlaxarlya 
241 pltqdhtvdil gplsaqtgia vlMhSLaalke 
301 sgvtfq 




Key residue 



Catalytic His 




dml npnyedlllr 
qpgq tfavlacyng 
ptg vhagtdlegk 
ndf nivantJcynye 
ede ftpfdwrqc 
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FIGURE 16 
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196 30-4096 40^096 60^09660-7096 
amino acid identity 





SEQ ID NO: 6574 & [UCU(C^CU) 
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FIGURE 18 
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FIGURE 19 
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FIGURE 20 
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FIGURE 21 



FIGURE 21 A 



G1 cons 




FIGURE 21B 




AIBV 
G1 cons 
SARS 
G2 cons 



FIGURE 22 




LP 


coQed-con 


colled-coil 


coiled-coll 


membrane anchor 


( ir 






1 ^^ 




l?2cL£jrr^-’^- ^ 


1 1 


1 1 


1 1 


1 


1 


1 1 



1 24 88 122 143 181 269 287 331 351 405 aa 



wo 2004/092360 



83/193 



PCT/US2004/011710 



FIGURE 23 

LPRKSQPTSISCRSVL-TNFKICVAVARLHA-CTYAV-TIINFTWDKKRVTRPSSADCIi 
RFRPCCSRSSAYLGFVRV-PKGKMESLVLGVNEKTHVQLSLPVLQVRDVLVRGFGDSVEE 
AliSEAKEHIiKNGTCGLVEIiEKGVLPQLEQPYVFIKRSDALSTNHGHKWELVAEMDGIQY 
GRSGITLGVLVPHVGETPIAYRNVLLRKNGNKGAGGHSYGIDLKSYDLGDELGTDPIEDY 
EQlSnWKriTCHGSGALRELTREriNGGAWRYVDNNFCGPDGYPIiDCIKDFIiARAGKSMCTI.SE 
QLDYIESKRGVYCCRDHEHEIAWFTERSDKSYEHQTPFEIKSAKKFDTFKGECPKFVFPL 
NSKVKVIQPRVEKRKTEGFMGRIRSVYPVASPQECNNMHLSTLMKCNHCDEVSWQTCDFL 
KATCEHCGTENLVIEGPTTCGYLPTNAWKMPCPACQDPEIGPEHSVADYHNHSNIETRL 
RKGGRTRCFGGCVFAYVGCYNKRAYWVPRASADIGSGHTGITGDNVETLNEDLLEILSRE 
RVNINIVGDFHLNEEVAIILASFSASTSAFIDTIKSLDYKSFKTIVESCGNYKWKGKPV 
KGAWNTGQQRSVLTPLCGFPSQAAGVIRSIFARTLDAANHSIPDLQRAAWILDGISEQS 
LRLVDAMVYTSDLLTNSVIIMAYVTGGLVQQTSQWLSNLLGTTVEKLRPIFEWIEAKLSA 
GVEFLKDAWEILKFLITGVFDIVKGQIQVASDNIKDCVKCFIDWNKALEMCIDQWIAG 
AKLRSLNLGEVFIAQSKGLYRQCIRGKEQLQLLMPLKAPKEWFLEGDSHDTVLTSEEW 
LKNGELEALETPVDSFTNGAIVGTPVCVNGLMLLEIKDKEQYCALSPGLLATNNVFRLKG 
GAP IKGVTFGEDTVWEVQGYKNVRITFELDERVDKVLNEKC SVYTVESGTEWEFACWA 
EAWKTLQPVSDLLTNMGIDLDEWSVATFYLFDDAGEENFSSRMYCSFYPPDEEEEDDAE 
CEEEEIDBTCEHEYGTEDDYQGLPLEFGASAETVRVEEEEEEDWLDDTTEQSEIEPEPEP 
TPEEPVNQFTGYLKLTDNVA IKCVDIVKEAQSANPMVI VNAANIHLKHGGGVAGALNKAT 
NGAMQKESDDYIKLNGPLTVGGSCLLSGHNLAKKCLHWGPNLNAGEDIQLLKAAYENFN 
SQDILLAPLLSAGIFGAKPLQSLQVCVQTVRTQVYIAVNDKALYEQWMDYLDNLKPRVE 
APKQEEPPNTEDSKTEEKSWQKPVDVKPKIKACIDEWTTLEETKFLTNKLLLFADING 
KLYHDSQNMLRGEDMSFLEKDAPYMVGDVITSGDITCWIPSKKAGGTTEMLSRALKKVP 
VDEYITTZPGQGCAGYTLEEAKTALKKCKSAFYVLPSEAPNAKEEILGTVSWNLREMLAH 
AEETRKLMPICMDVRAIMATIQKKYKGIKIQEGIVDYGVRFFFYTSKEPVASIITKLNSL 
NEPLVTMPIGYWHGFNLEEAARCMRSLKAPAWSVSSPDAVTTYNGYLTSSSKTSEEHF 
VETVSLAGSYRDWSYSGQRTELGVEFLKRGDKIVYHTLESPVEFHLDGEVLSLDKLKSLL 
SLREVKTIKVFTTVDNTNLHTQLVDMSMTYGQQFGPTYLDGADWKIKPHVNHEGKTFFV 
LPSDDTLRSEAFEYYHTLDESFLGRYMSALNHTKKWKFPQVGGLTSIKWADNNCYLSSVL 
LALQQLEVKFNAPALQEAYYRARAGDAANFCALILAYSNKTVGELGDVRETMTHLLQHAN 
LESAKRVLNWCKHCGQKTTTLTGVEAVMYMGTLSYDNLKTGVSIPCVCGRDATQYLVQQ 
ESSFVMMSAPPAEYKLQQGTFLCANEYTGNYQCGHYTHITAKETLYRIDGAHLTKMSEYK 
GPWDVFYKETSYTTTTKPVSYKLDGVTYTEIEPKLDGYYKKDNAYYTEQPIDLVPTQPL 
PNASFDNFKLTCSNTKFADDLNQMTGFTKPASRELSWFFPDLNGDWAIDYRHYSASFK 
KGAKLLHKPIVWHINQATTKTTFKPNTWCLRCLWSTKPVDTSNSFEVLAVEDTQGMDNLA 
CESQQPTSEEVVENPTIQKEVIECDVKTTEWGNVILKPSDEGVKWQELGHEDLM7SAYV 
ENTSITIKKPNELSLALGLKTIATHGIAAINSVPWSKILAYVKPFLGQAAITTSNCAKEtL 
AQRVFNNYMPYVFTLLFQLCTFTKSTNSRIRASLPTTIAKNSVKSVAKLCLDAGINYVKS 
PKFSKLFTI AMWLLLLS ICLGSLICWAAFGVLLSNFGAPS YCNGVRELYLNS SNWTMD 
FCEGSFPCSICLSGLDSLDSYPALETIQWISSYKLDLTILGLAAEWVLAYMLFTKFFYL 
LGLSAIMQVFFGYFASHFISNSWLMWFIISIVQMAPVSAMVRMYIFFASFYYIWKSYVHI 
MDGCTSSTCMMCYKRNRATRVECTTIVNGMKRSFYVYANGGRGFCKTHNWNCLNCDTFCT 
GSTFISDEVARDLSLQFKRPINPTDQSSYIVDSVAVKNGALHLYFDKAGQKTYERHPLSH 
FVNLDNLRANNTKGSLPINVIVFDGKSKCDESASKSASVYYSQLMCQPILLLDQALVSDV 
GDSTEVSVKMFDAYVDTFSATFSVPMEKLKALVATAHSELAKGVALDGVLSTFVSAARQG 
WDTDVDTKDVIECLKLSHHSDLEVTGDSCNNFMLTYNKVENMTPRDLGAC IDCNARHIN 
AQVAKSHNVSLIWNVKDYMSLSEQLRKQIRSAAKKNNIPFRLTCATTRQWNVITTKISL 
KGGKIVSTCFKLMLKATLLCVLAALVCYIVMPVHTLSIHDGYTNEIIGYKAIQDGVTRDI 
ISTDDCFANKHAGFDAWFSQRGGSYKNDKSCPWAAIITREIGFIVPGLPGTVLRAINGD 
FLHFLPRVFSAVGNICYTPSKLIEYSDFATSACVLAAECTIFKDAMGKPVPYCYDTNLLE 
GSISYSELRPDTRYVLMDGSIIQFPNTYLEGSVRWTTFDAEYCRHGTCERSEVGICLST 
SGRWVLNNEHYRALSGVFCGVDAMNLIANIFTPLVQPVGALDVSASWAGGIIAILWCA 
AYYFMKFKRVFGEYNHWAANALLFLMSFTILCLVPAYSFLPGVYSVFYLYLTFYFTNDV 
SFLAHLQWFAMFSPIVPFWITAI YVFC I SLKHCHWFFNNYLRKRVMFNGWFSTFEEAAL 




wo 2004/092360 PCT/US2004/011710 

84/193 

CTFLIiITKEMYIiKLRSETIiLPIiTQYMRYIiAIiYNKYKYFSGALT)TTR'VRB A arr HT .ft TraT .>m 
PSNSGADVLYQPPQTSITSAVLQSGFRKMAFPSGKVEGCMVQVTCGTTTLNGIiWIjDDTVY 
CPRHVICTAEDMrJKfPITfEDLLIRKSNHSFLVQAGNVQriRVIGHSMQNCLriRLKVDTSNPK 
TPKYKFVRIQPGQTFSVTjACYNGSPSGVYQCAMRPNHTIKGSFLNGSCGSVGFNIDYDCV 
SFCYMHHMELPTGVHAGTDLEGKFYGPFVDRQTAQAAGTDTTITIiNVLiAWIiYAAVINGDR 
WFLNRFTTTLNDFNLiVAMKYNYEPLTQDHVDIIjGPIiSAQTGIAVLDMCAAIjKELLQNGMN 
GRTILGSTIIjEDEFTPFDVVRQCSGVTFQGKFKKIVKGTHHWMLLTFLTSIiLILVQSTQW 
SLFFFVYENAFIiPFTLGIMAIAACAMLIiVKHKHAFLCLFLLPSLATVAYFNMVYMPASWV 
MRIMTWLELADTSLSGYRLKDCVMYASALVLLI LMTART\rYDDAARRVWTIjMNVITLVYK 
VYYGNALDQAI SMWALVI SVTSNYSGWTTIMFLARAIVFVCVEYYPLIiFITGNTLQC IM 
LVYCFLGYCCCCYFGLFCLLiNRYFRLTIjGVYDYIjVSTQEFRYMNSQGLIjPPKSSIDAFKL 
NIKLLGIGGKPCIKVATVQSKMSDVKCTSWIiLSVLQQIiRVESSSKLWAQCVQriHNDIIiL 
AKDTTEAFEKMVSLIiSVLLSMQGAVDINRLCEEMLDNRATIjQAIASEFSSLPSYAAYATA 
QEAYEQAVANGDSEWLKKLKKSIiNVAKSEFDRDAAMQRKLEKMADQAMTQMYKQARSED 
KEIAKVTSAMQTMLFTMLRKLDNDAIjNNI INNARDGCVPLNI IPLTTAAKLMVWPDYGTY 
KNTCDGNTFTYASALWEIQQVVDADSKIVQLSEINMDNSPNLAWPIjIVTAIiRANSAVKLQ 
NNELSPVALRQMSCAAGTTQTACTDDNALAYYNNSKGGRFVIjALLSDHQDIiKWARFPKSD 
GTGTIYTEIiEPPCRFVTDTPKGPKVKYLYFIKGLNNIjNRGMyiiGSIiAATVRLQAGNATEV 
PANSTVLSFCAFAVDPAKAYKDYIiASGGQPITNCVKMIjCTHTGTGQAITVTPEANMDQES 
FGGASCCLYCRCHIDHPNPKGFCDLKGKYVQIPTTCANDPVGFTLRNTVCTVCGMWKGYG 
CSCDQLREPmQSADASTFLNGFAV-VQPVLHRAAQAIiVIjMSSTGLLIFTTKKIiljVLQSS 
-KLIAVASRRRMRKAIY-TLTL-LRGILCIiTTNMKRLFITWLKIVQRLLSMTFSSIiE-MV 
TWYHIYHVSV-LNTQVJLI-SMLYVTIJplVIVIH-KKYSSHTIAVMMIISIRRIGMTS-RI 
LTSYAYMLT-VSVYANHY-RIjYNSAMLCVMQAIj-AY-H- I IRILMGTGTISVISYK-HQA 
AEFLLWIHITHC-CPSSIj-IjGHWIiLSPIWMIjISQNHLLSGIC-NMIIjRKRDFVSSTVIIjN 
IGTRHTIPIVIiTVWMIGVSFIVQTLMCYFLIiCFHIiQVLDH — EKYL-MVFLLLFQLDTIF 

vs-esyiirm-tyiarvsvsrnf-cmlliqlcmqllaiyc-inalhafq-lh-qtmllfk 

LSNPVILIKTFMTLLCLKVSLRKEVLLN-NTSSIiLRMATLLSVIMTIIVIICQQCVISDN 
SYS-LKLIiINTLIVTMVAVLMPTK-SLTIWINQIiVSHIjINGVRIjDFIMTQ-VMRIKMHFS 
RIIiSVMSSLL-IiK-ILSMPLVQRIELAP-LVSLSWL-QIDSFIRNY-SQ-PPLEELLW- 
LEQASFTVAGIIC-KLFTVM-KLHTrjWVGIIQNVTEPCLTCLG-WPIiLFLriANITIiAVTY 
HTVSTG-LTSVRKY-VRWSCVAAHYMIjNQVEHHPVMLQLIjMLIVSLTFVKIiLiQPM-MHFF 
QLMVIR-LTSMSAIYNTGSMSVSIEIGMLIMNSWMSFTLTCVNISP — FFIiMMPLCAITV 
TMRLKV — LALRTLRQFFIIKIMCSCLRQNVGLRLTLLKDLTNFAHSIQC-IjNKEMITCT 
CLTQIHQEY-AQAVLSMILSKQMVHIi-riKGSCHWIjIjMLTHLQNILIRSMIjMSPTCIYNTIj 
ESYMMSLLATCWTCIP-C-LMITPHGTGNLSFMRLCTHHIQSCRIj-VLVYCAIHRLHFVA 
VPVLGDHSYVASAAMTMSF^^^HTN-CCLLIPMFAMPQVVMSlJM-HNCI -EV-AI lASHIS 
LPIiVFHYVLMVRFLVYTKTHV-AVTMSLTSMR-QHVIGLMLAITYIjPTLVIjRDSSFSQQK 
RSKPLRKHLSCHMVrjPLYAKYSLTENCIFHGRLENLDHH-TETMSIiLVTV-LKIVKYRIiE 
STPLKKVTMVMIjLCTEVTiRHTS-MLVITLC-HLTIi-CHLVHLL-CHKSTM-ELIiACTQHS 
TSQMSFLAMLQIIKRSACKSTLHSKDHLVLVRVIIiPSDLIiSITHIjLA-CIRHAIjMQriliMP 
YVKRH-NICP- INWESYLRVRA-SVLINSK- IQH-NSMFSAL-MHCQKQLLTL-SLMKS 
LWLLIMT-VLSMLDFVQNTTSILAILLNYQPPAHC-LKAH-NQNILIQCADL-KQ-VQTC 
SLELVAWLLKLLTL-VIi-FMTIS-KHTRISQLNASKCSTKVLLHMMFHLQSTDLK-AL- 
ENFLHAILLGEKLFLSHLIIHRTL-LQKS-DCLRRLLIHHRVLNMTMSYSHKLLKQHTLV 
MSTASMWLSQGQKIx/yFCA-CLIEIFtmiClSIIiQV-KYHVAMWLHYKQKM-rjDFriRTVVRSL 
LVFILHRHLHTSALI-SSRLKDYVLTYQAYQRT-PTVDSSL-WVSK-ITKSMVTLICLSP 
AKKLFVTFVRGIiAIjM-RAVMQIjEMLWVIjTYXjSS-DFLQVIjT — LYRI lVMLTLKITQNSPE 
LMQNIiHQVTSLNILYHSCIKACPGM-CVLR-YKCSVIH-KDCQTESCSSFGRMALSLHQ- 
STLSRIiDLKERWCVTNVQIAFIiLHQILMPAGI ILWVL.TMS ITHL-LMFSSGALiRVTFRV 
TMTNIARYMEMHMWLVVMLS-IiDV-QSMSALIjSALIGIjIjNTLLi-EMN-GLILIjAEKYNTW 
L-SLHCLLISFQFFMTLEIQRIjSSVCIiKLK-NGSSTMLSHWTKLTK-RNSSIIjMLHITI 
NSIjMVFVCFGIVTLIVTQPMQLCVGLTQESCQT-TYQAVMVWCM- I SMHSTLQLS IKVH 
LLI - SNCLSFTILrVLVSLMANK-CRILIMFHSNLLRVTiHDAI - WLFADTMQMSTDSTW 
lOIII — FLLDIiAYGFTNNLILITCGIHLPGYRV-KMWLIMLLIKDTLMDTPAKHIiFPSLI 
MLFTQR-MVIiMWRSLKIRQHFLLMLHLSFGLSVTLNQCQRLRYSIIWVLISLLIL-SGTT 
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KEKPQHMYLQ-VSAQ-LTLPRNIiLRVIiVLHIiLSCrjMVEWKDR-TFIjETPVMVF — QKVQS 
KV-HLQRDQHKLASMESH-LENQ-KHSLTTIjRK-TAIjFNSCLKPTIjIiRAET-RILSPDHK 

WKLTFSSSLWMNSYSDISSRAMPSNTSFMEISVMDNIiAVFI A-PSAHKIHHLN-RIL 

SLWTAQ-KITS-QMRKQVHQNVCVL-LIFYLMTLSR — SHKICQ-FQKWSRLQLTMLKFH 

SCFGVRMDMLKPSTQNYKQVERGNQVLRCLTCTRCKECFLKSVTFRIMVKMLLYQKE 

MSQSimCVOT-IHLL-LYPTT-ELFTLVIiAIiIKELHQVQLCSDNGCQriAHYLSIQirjMT 
SSPTHIIiL-LETVQQYIRLINGTLIiIiAICMTLGPNM-QKRMTI.KKGFSLICVDIi-SKN-P 
WWIj-D-R-QS ILGMLTFTSLWAI SHGGQIiLLQM-MHHHRKHF-LGLTIIiASRRNKIiMAI 
PCMLTTFSGGTQILSSCLPIHSIjT-ANFIjIjN-EEIiL-CLI.RRIKSMI-FILFWKKVGLSL 
EKTTEIjWFQVIFLLTTKRTCIiF S YYFIiliSLVWTLTGAPrjLMMFKLrj ITIiNILHL -GGFT 
irMKFLDQTLFI-LRI YFFHFIIiMLQGFILLI IRLATLSYLIiEMVFILLPQRNQMLSWG 
FLVIiP-TTSHSR-LLLTILIjMLIjYEHVTIiNCVTTLSLLFIiNPWVHRHIL-YSIMHLIALS 
STYLMPFRIjMFQKSQVILNTYESLCLKIKMGFSMFIRAINL-M-FVIYLIiVLTL-NLFLS 
CLLVLTLQILEPFLQPFHLLKTFGARQLQPILriAI - SQLHLCS SMMKMVQSQMLLIVIjKI 
HLLNSNALLRALRLTKEFTRPLISGLFPQEML-DSLILQTCVIiLERFLMIjIiNSHiSMHGR 
EKKFIilVIiljITIiCSTTQHFFQPIjSAMAFLPLS -MIFASPMSMQILL- SREMM-DK-RQDK 
LVLIiLI 1 1 INCQMI SWWSLLG ILGTLMLLQLVI 1 1 INIGI LDMASLGPIiRETYtiMCLSP 
LMANIiAPHTjLLIVIGH-MIMVFTPLliAIiATNLTEL-YFLrjNF-MHRPRFVDQNYPriTLLR 
TSVSIIjILMDSI.VLVC-LIjLQRr)FNHFNNIiAVMFLISIiIPFEILKHLKY-TFHLAIiI.GV- 
V-LHLEQMLjHIjKLIiFYIKMIjTAIiMFIjQQFMQINSHQLGAYILLETMYSRLKQAVIi-ELSM 
STLIMSATFLLELAFVLVTIQFIjYYVVLAKNLLV/IilljCIj-VIjIVQLLTLITPIiLYIjLTFQ 
liALIiQK-CI-FLWLKPP-IVICTSi^iriljNVLICFSNMVAFAHN-IVHSQVIiIjIiNRIATHV 
KCSLKSNKCTKPQL-NIIiVVIiIFHKYYLTL-SQLRGIjIjIjRTCSIjIR-HSIiMIiAS-SNMAN 
A-VIIJ>II*EI SFVRRSSMDLQCCHIiCSLMI -LIiPTIiLL-LVVI.PLLDGHLVriALIiFKYIjLL 
CKWHIGSMALELPKMFSimTKlSnKSPTNIjTRRLVKFKNHLQQHQIiHWASCKTIiLTRMLKH- 
THLUSINIALILVQFQVC-MISFRDLIKSRRRYKLTG-LQADFKAFKPM-HNN-SGLLKSG 
LLLILIiLLKCLSVFLDNQKELTFVERATTLCPSHKQPRMVLSSYMSRMCHPRRGTSPQRQ 
QFVMKAKHTSLVKVFLCLMAIiLGLLHRGTSFIiHK-IjIiQTIHIjSQEIVMSIiLASLTTQFMI 
• LCNLSli'mSKKSWTSTSKI IHHQMLILATFQALTIiliSSTFKKKIiTASMRSLKI -MNHSLT 
FKNWENMSNILNGLGMFGSASLIjD-LPSSWIjQSCFVA-LWAVASRVHAIiVVIiAASLMRM 
TLSQFSRVSNYITHKRTYGFVYEIFYSWINYCTASKN-QCFSCKYCSCYSNDTATSLTPF 
RMACYWRCISCCFSERYQNNCAQ-KMAASPL-GLPVHLQFTAAICYHLFTSFACRCRYGG 
AIFVPIiCL.DIFSTMHQRM-NYYEMIjALLEVQIQEPITL-CQIiIjCLIjAHT-L-IjIjYTI-QC 
HRYNCRY-R-RHFNTKTQRRLPNWWLF-G-ALRC-RLCRCTWIjFHRSLLPA-VYTNYYRH 
WY-KCYILHL-QAC-RPTECANTHNRRLFRSC-SSNGSNr. — ADDDY-RAFVSTRK-VRT 
YVLIRFGRNRYVNS — RTSFSCFRGILASHTSHPYCASIVCVIiLQYC-REFSKTNGLRLL 
AC-KSELF-RSS - SSGIxNELTI 1 1 ILFGTLTLLIMADNGTITVEELKQLIiEQWNIjVIGFL 
FIiAWIMLLQFAYSNRNRFLYIIKLVFLWLLWPVTLACFVIiAAVYRINWVTGGIAIAMACI 
VGLMWLSYFVASFRLFARTRSMWSFNPETNILLNVPLRGTIVTRPIjMESELVIGAVIIRG 
HLRMAGHSLGRCDIKDLPKEITVATSRTLSYYKLGASQRVGTDSGFAAYlSIRYRIGNYKIiN 
TDHAGSNDNIALLVQ-VTTDVSSC-LPGYNSRDIDYHYEDFQDCYIiES-RYNKFNSETII 
-ASN-EELFGVR — RTYGVRLSIKRT-KLFSS-H-IiYLHLASYITIRSVIjEVRLYY-KNL 
AHQEHTRAIHHFTLLLTINLH-LAIiAHTLLLLVIiTVLDIPISCVQDQFHQNFSSDKRRFN 
KSSTRHFFSLLLL-YF-YFASPIiRERQNE-AHFN-LLFVTiFSLSAIPCFNNAYYILVFTR 
NPGSRRTLYQSIiNEHETSHCFDIjYFSMQIjHMHCSTALCI — TSCA-RSL-GTTIjGVILIA 
LLGFVL-ERFYIiFIDGTLWFKHAHIjMLLSTVKIQIjVVRL-LGVGTFMKVTKLLHLETYI.Ij 
F-INEQIKMSDNGPQSNQRSAPRITFGGPTDSTDNNQNGGRNGARPKQRRPQGIiPNNTAS 
WFTAIiTQHGKEEriRFPRGQGVPINTNSGPDDQIGYYRRATRRVRGGDGKMKELSPRWYFY 
YLGTGPEASLPYGANKEGIVWVATEGALNTPKDHIGTKNPNNNAATVLQIjPQGTTLPKGF 
YAEGSRGGSQASSRSSSRSRGNSRNSTPGSSRGWSPARMASGGGETALALL.LLDRLNQLE 
SKVSGKGQQQQGQTVTKKSAAEASKKPRQKRTATKQYNVTQAFGRRGPEQTQGNFGDQDIi 
IRQGTDYKHWPQIAQFAPSASAFFCTISRIGMEVTPSGTWLTYHGAIKLDDKDPQFKDNVI 
liLNKHIDAYKTFPPTEPKKDKKKKTDEAQPLPQRQKKQPTVTDIiPAADMDDFSRQLQNSM 
SGASADSTQA-TLMMTTQGRWAM-TFSQFRLRYIVYSCAE-ILVTKQHK-V-LTLISHSN 
L-SMCNIREDLKEPPHFHRGHAEYDRGYSE-C-GELPIWKSPNV-N-F-CYPHVILIAS 
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FIGURE 24 

TQEKPTNLDLL-ICSLNEL-NLCSCRSAACLVHLRSINNNKFYCR-QETSNSSLFCRLLT 
VSSVLQS 1 1 SI PRFRPGVTER-DGEPCSWCQRENTRPTQFACPSG-RRASAWLRGLCGRG 
PIGGT-TPQKWHLWSSRAGKRRTAPA-TAIiCVH-TF-CLKHQSRPQGR-AGCRNGRHSVR 
S-RYNTGSTRATCGRNPNCIPQCSSS-ER — GSRWS-LWHRSKVL-LR-RAWH-SH-RI.- 
TKLEH-AWQWCTP-THS-AQWRCSHSLCRQQFLWPRWVPS-LHQRFSRTRGQVNVHSFRT 
T-IiHRVEERCLLLP-P-A-NCLVH-AL — ELRAPDTLRN-ECQEI-HFQRGMPKVCVSS- 
LKSQSHSTTC-KEKD-GFHGAYTLCVPCCISTGV-QYALVYLDEM-SLR-SFMADVRIiSE 
SHL-TIiWH-KFSY-RTYYMWVPTY-CCSENAMSCLSRPRDWT-A-CCRLSQPLKH-NSTP 
QGR-D-MFWRIiCVCLCWIiL — ACIiIjGSSC-C-YWIiRPYWHYW-QCGDL.E-GSP-DTES-T 
C-H-HCWRFSFE-RGCHHFGIFIiCFYKCLY-HYKES-IiQVFQNHC-VLR-L-SYQGKAElK 
RCLEHWTTEISFNTTVWFSLTGCWCYQINFCAHT-CSKPLNS-FAKSSCHHT-WYF-TVI 
TSCRRHGIiYFRPAHQQCHYYGICNWWSCTTDFSVW-SFGHYC-KTQAYIi-MD-GET-CR 
S - 1 SQGCLGDSQI SHYRCF-HRQGSNTGCFR-HQGLCKMIjH-CC-QGTKNVH- SSHYRWR 
KVAITQLR-SLHRSKQGTLPSVYTWQGAAATTHAS-GTKRSNIjS-R-FT-HSTYIi-GGCS 
QER-rTRSTRDAR— LHKWSYRRHTSLCKWPHALRD-GQRTILRIVSWFTGYKQCLSLKRG 
CTN-RCNLWRRYCLGSSRLQECENHI-A — TC-QSA — KVLCLHC^IRYRSY-VCMCCSR 
GCCEDFTTSF-SPYQHGY-S — VECSYILLI — CW-RKLFITYVLFLLPSR-GRRGRCRV 
-GRRN — NL-T-VRYRG-IiSRSPSGIWCLS-NSSS-GRRRGRLAG-YY-AIRD-ARTRTY 
T-RTS-SVYWLFKTY-QCCH-MC-HR-GGTKC-SYGDCKCC-HTPETWWWCSRCTQQGNQ 
WCHAKGE^ — liH-AKWPSYSRRVLFAFV'TT-SC-EVSACCWT-PKCR-GHPAS-GSI-KFQF 
TGHLTCT IWSRHIWC -TTSVFTSVRADGS YTGEYCSQ- QS SL- AGCHGLS — PEA- SGS 
T-TRGATKHRRFQN-GEICRTEACRCEAKN-GLH — GYHNTGRN-VSYQ-VTLVC-YQW- 
AliP-FSEHA-R-RYVFP-EGCTLHGR-CYH-W-YHIiCCNTLQKGWWHY-DALKSFEESAS 
— VYNHVPWTRMCWIjYT-GS-DCS-EMQICILCTTFRST-C-GRDSRNCILEFERNACSC 
-RDKKINAYMHGG-SHNGNHPT-V-RN-NSRGHR-LWCPILLLY — RACSFYYYEAELSK 
-AACHNANWLCDTWP- S-RGCALYAFS-SSCRSVSI ITRCC YYI-WI PHFVIKDI -GALC 
RNSFFGWLLQRIiVLFRTAYRVRC-IS-AW-QNCVPHSGEPRRVSS-R-GSFT-QTKESLI 
PAGG-DYKSVHNCGQH-SPHTACGYVYDIWTAVWSNUiGWC-CYKN-TSCKS-G-DFLCT 

T HTT — SFRVLPYS — EFSW-VHVCFKPHKEMEISSSWWFNFN-MG — QLLFV-CFI 

STSTA-SQIQCTSTSRGIiL-SPCW-CC-LLCTHTRLQ — NCWRAW-CQRNYDPSSTAC-P 
GICKASS -CGV-TLWSENYYIiNGCRSCDVYGYS IL — S-DRCFHSMCVWS -CYTI SSTTR 
VFFCYDVCTTC-V-ITARYIIjyiCE-VHW-IiSVWSLHSYNC-GDPIjSY-RSSPYKDVRVQR 
TSD-CFLQGNIIiHYNHQACW-TRWSYLHRD-TKIGWVL-KG-CLIiYRAAYRPCTNSTIT 
KCEF — FQTHMF-HKIC — FKSNDRLHKASFTRAICHIIiPRIiEWRCSGY-L-TLFSEFQE 
RC - ITA-ANCI^-PGYNQDNVQTKHLVFTIiSLEYKASRYFKFI - SSGSRRHTRNGQSCIi 
-KSTTHIj-RSSGKSYHTEGSHRV-RENYRSCRQCHT-TIR-RC-SNTRVRS-GSYGCLCG 
KHKHYH-ET — AFTSLRFKNNCHSWYCCN — CSLE-NFGLCQTILRTSSNYNIKIiR-EIS 
TTCV-QLYALCVYirVPIVYFY-KYQF-N-SFTTYNYC-K-C-ECC-IMFGCRH-LCEVT 
QIF-IVHNRYVAIWKYIiLRFSNLCNCCFWCTL.I-FWCSFLL-WR-RIVS-FV-RYYYGF 
L-RFFSLQHLFKWIRLP-FLSSS-NHSGDDFIVQARLDNFRSGR-VGFGIYWHKILLFI 
RSFSYNAGVLWLFC-SFHQQFIiAHWYH-YCTNGTRFCNG-DVHLLCFPLIjHMEEIiCSYH 
GWLHIiFDriHDVL-AQSCHTR-VYNYC-WHEEIFLCLCKWRPWIiLQDSQLELSQL-HIIiHW 

- YIH SCS -FVTPV-KTNQPY- PVIVYC — CCCEKWRASPLL- QGWSKDL-ETSALPF 

CQFRQFES-QH-RFTAY-CHSF-WQVQMRRVCF-VCFCVIjQSADVPTYSVA-PSSCIRRW 
R-Y-SFR-DV-CIiCRHLFSNF-CSYGKT-GTCCYSSQRVSKGCSFRWCPFYIRVSCPTRC 
C-YRC-HKGCY-MSQTFTSL-IjRSDR-QIj-QFHAHIi — G-KHDAQRSWRMY-I i-CKAYQC 
PSSKKSQCFTHLECKKLHVFI-TAA-TNS-CCQEEQHTF-TNLCYN-TGCQCHNY-NLTQ 
GW-DC-YLF-TYA-GHIIVRSCCIGLLYRYASTYIVNP-WLHK-NHWLQSHSGWCHS-HH 
FY — LFCK-TCWF-RMV-PAWWFIQK-QKLPCSSCYHYKRDWFHSAWLTGYCAESNQW-L 
LAFSTSCF-CCWQHLLHTFQTH-V — FCYLCIiRSCC-VYNF-GCYGQTCAILL-H-FARG 
FYFIi — ASSRHSLCAYGWFHHTVS-HLPGGFC-SSNNF-C-VL-TWYMRKVRSRYLPIYQ 

W-MGS ALQSSIRSFLWC-CDESHS-HLYSSCATCGCFRCVCFSSGWWYYCHIGDLCC 

LLLYEIQTCPW-VQPCCCC-CTFVFDVFHYTLSGTSIiQLSAGSLLSLLLVUDILFHQ-CF 

ILGSPSMVCHVFSYCAFIjDNSNIiCILYFSEALPIjVIj-QLS-EKSHV-WSYI-YIiRGGCFV 
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YLFAQQGNVPKIA-RDTVATYTV-QVSCSI-QVQVFQWSLRYYQLS-SSLLPIiSKGSK-L 
-QliRC-CSLPTTTDINHFCCSAEWF-ENGIPVRQS-RVHGTSNLWNYNS-WIWG-HSIL 
SKTCHLHSRRHA-S-L-RSAHSQIQP-LSCSGWQCSTSCYWPFYAKLSA-A-S-YF-P-D 
TQV-ICPYPTWSNIFSSSMLQWFTIWCLSVCHET-SYH-RFFP-WIMW-CWF-H-L-LRV 
FLIjYASYGASNRSTRWY-LRR-ILWSIC-QTNCTGCRYRHNHNIKCFGMAVCCCYQW — V 
vs — IHHYFE-L-PCGNEVQL.-TFDTRSC-HIGTSFCSNRNCRLRYVCCFERAAABWYEW 
SYYPW-HYFRR-VYTI-CC-TMLWCYLPR-VQENC-GHSSLDAFNFLDITIDSCSKYTW 
TVFLCLRECFLAIYSWYYGNCCMCYAAC-A-ARILVIiVSVTFSCNSCLL-YGLHAC-LGD 
AYHDMA-IG-H-LVWL-A-GLCYVCFSFSFAYSHDSSHCL — CC-TCLDTDECHYTCLQS 
liLW-CFRSSYFHVGDSYFCNL-LFWCRYDYHVFS-SYSVCVC-VLPIVIYYWQHIiTVYHA 
CLLFLRLLLLLLLWPFLFTQPLLQAYSWCL-IiIiGLYTRI-VYEIiPGAFAS-E-Y-CFQA- 
H-WGYWR-TMYQGCYCTV-NV-RKVHICGTALGSSTT-SRVIF-IVGTMCTTPQ-YSSC 
KRHN-SFREDGFSFVCFAIHAGCCRH — WRGajAR-PCYSSGYCFRI-FFTIICRLCHCP 
GGIj-AGCS-W-F- SRSQKVKEIFECG- I -V- P-CCHATQVGKDGRSGYDPNVQTGKI -GQ 

EGKSN-CYANNAIiHYA-EA CT-QHYQQCA-WLCSTQHHTIDYSSQTHGCCP-LWYLQ 

EHIi-W-HLYICICTLGNPASC-CG-QDCST — N-HGQFTKFGIiASYCYSSKSQLSC-TTE 
— TESSSTTTDVLCGWYHTNSLY — QCTCLL-QFEGR-VCAGITIRPPRSQMG-IP-E-W 
YRYNLHRTGTTL-VCYRHTKRA-SEILVLHQRIiKQPK-RYGAGQFSCYSTSSGWKCYRST 
CQFNCAFLLCFCSRPC -SI -GLPSIOflRTlTSrHQLCEDVVYTHWYRTGNYCNTRS -HGPRVL 
WWCFMLSVL-MPH-PSKS-RIL-LER-VRPlsITYHLC — PSGFYT-IOHSLYRI jKNVERLWI. 
-L-PTPRTIiDAVCGCINVFKRVCGVSAARLTPCGTGTSTDWYRAFDIYNEKVAGFAKFL. 
KTNCCRFQEKDEEGNIiliDSYFWKRHTMSNYQHEETIYNLVKDCPAVAVHDFFKFRVDGD 
MVPHISRQRIiTK5fTMADLVYALRHFDEGNCDTLKEILVTYNCCDDDYFNKKDWYDFVENP 
DILRVYANLGERVRQSLLKTVQFCDAMRDAGIVGVIiTLDNQDLNGNWYDFGDFVQVAPGC 
GVPrVDSYYSliLMPILTI.TRALAAESHMDADLAKPLIKWDLIiKYDFTEERLCIiFDRYFKY. 
'WDQTYHPNCINCLDDRCILHCANFNVIiFSTVFPPTSFGPLVRKIFVDGVPFVVSTGYHFR 
ELGWHNQDVNLHSSRLSFKEIiliVYAADPAMHAASGNLLLDKRTTCFSVAALTlSrNVAFQT 
VKPGNFNKDFYDFAVSKGFFKEGSSVELKHFFFAQDGNAAISDYDYYRYNLPTMCDIRQIi. 
LFVVEVVI)KYFIX:YDGGCiNANQVIVNNIiDKSAGFPFNKWGKARIiYYDSMSYEr)QDAIiFA 
YTKElNVIPTITQMNIiKYAI SAKNRARTVAGVSTCSTMTNRQFHQKLLKS lAATRGATWI 
GTSKFYGGWHNMLKTVySDVETPHLMGWDYPKCDRAMPNMLRIMASIiVLARKHNTCGNIiS 
HRFYRIiANECAQVLSEMVMCGGSIiYVKPGGTSSGDATTAYANSVFNICQAVTANVNALIiS 
Trxa^IADKYVRNLQHRLYEGIiYRNRDVDHEFVDEFYAYLRKHFSMMIL.SDDAWCYNSN 
YAAQGIiVASIKNFKAVLYYQNNVFMSEAKCWTETDLTKGPHEFCSQHTMLVKQGDDYVYli 
PYPDPSRILGAGCFVDDIVKTDGTIiMIERFVSLAIDAYPLTKHPNQEYADVFHLYLQYIR 
KLHDELTGHMLDMYSVMIiTNDNTSRYWEPEFYEAMYTPHTVLQAVGACVLCNSQTSLRCG 
ACIRRPFLCCKCCYDHVISTSHKLVLSVNPYVCNAPGCDVTDVTQLYLGGMSYYCKSHKP 
PISFPLCANGQVFGLYKNTCVGSDNVTDFNAIATCDVmiAGDYILANTCTERIiKIiFAAET 
LKATEETFKLSYGIATVREVLSDRELHLSWEVGKPRPPLNRNYVFTGYRVTKNSKVQIGE 
YTFEKGDYGDAWYRGTTTYKLNVGDYFVIiTSHTVMPLSAPTLVPQEHYVRITGLYPTIiN 
ISDEFSSNVANYQKVGMQKYSTLQGPPGTGKSHFAIGLALYYPSARIVYTACSHAAVDAL 
CEKALKYLPIDKCSRIIPARARVECFDKFKVNSTLEQYVFCTVNALPETTADIWFDEIS 
MATNYDLSWNARIiRAKHYVYIGDPAQLPAPRTLLTKGTIiEPEYFNSVCRIiMKTIGPDMF 
LGTCRRCPAEIVDTVSALVYDNKIjKAHKDKSAQCFKMFYKGVITHDVSSAINRPQIGVVR 
EFLiTRNPAWRKAVFISPYNSQNAVASKILGLPTQTVDSSQGSEYDYVIFTQTTETAHSCN 
VNRFNVAITRAKIGILCIMSDRDLYDKLQFTSLEIPRKNVATLQAENVTGLFKDCSKIIT 
GLHPTQAPTHLSVDIKFKTEGLCVDIPGIPKDMTYRRL.ISMMGFKMNYQVNGYPNMFITR 
EEAIRHVRAWIGFDVEGCHATKDAVGTNLPLQLGFSTGVNIjVAVPTGYVDTENNTEFTRV 
NAKPPPGDQFKHLIPLMYKGLPWNWRIKIVQMLSDTLKGLSDRWFVLWAHGFEIiTSMK 
YFVKIGPERTCCLCDKRATCFSTSSDTYACWNHSVGFDYVYNPFMIDVQQWGFTGfJLQSN 
HDQHCQVHGNAHVASCDAIMTRCIiAVHECFVKRVDWSVEYPIIGDELRVNSACRKVQHMV 
VKSALLADKFPVliHDIGNPKAIKCVPQAEVEWKFYDAQPCSDKAYKIEELFYSYATHHDK 
FTIXSVCLFWNCNVDRYPANAIVCRFDTRVIiSNIiNLPGCDGGSLYVNKHAFHTPAFDKSAF 
TNIiKQLPFFYYSDSPCESHGKQWSDIDYVPLKSATCITRCNLGGAVCRHHANEYRQYLD 
AYNMMI SAGFSLWI YKQFDTYNIiWNTFTRIjQSIiENVAYNWNKGHFDGHAGEAPVS I INN 
AVYTKVDGIDVEIFENKTTLPVNVAFELWAKRNIKPVPEIKILNNLGVDIAANTVIWDYK 
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REAPAHVSTIGVCTMTDIAECKPTESACSSLTVIiFDGRVEGQVDLFRNARNGVIjITEGSVK 
GLTPSKGPAQASVNGTVTLIGESVKTQFNYFKECVDGIIQQLPETYFTQSRDLEDFKPRSQM 
ETDFLELAMDEFIQRYKIiEGYAFEHIVYGDFSHGQIiGGIiHIiMIGLAKRSQDSPIiKLEDFI 
PMDSTVKNYFITDAQTGSSKCVCSVIDLLIjDDFVEIIKSQDLSVTSKWKVTIDYAEISF 
MLWCKDGHVETFYPKLQASRAWQPGVAMPNIjYKMQRMLIjEKCDLQNYGENAVIPKGIMMN 
VAKYTQI.CQYLNTLTIiAVPYNMRVIHFGAGSDKGVAPGTAVIjRQWLPTGTriI.VDSDLNDF 
VSDAYSTLIGDClATVHTANKWDLiIISDMYDPRTKHVTKENDSKEGFFTYLCGFIKQKLAIi 
GGSIAVKITEHSWNADLYKLMGHFSWWTAFVTNVNASSSEAFIiIGANYLGKPKEQIDGYT 
MHANYIFWRNTNPIQLSSYSLFDMSKFPLKLRGTAVMSLKENQINDMIYSLLEKGRLIIR 
ENNRVWSSDILVNN-TNMFIFIiLFLTLTSGSDLDRCTTFDDVQAPNYTQHTSSMRGVYY 
PDEIFRSDTLYLTQDLFLPFYSNVTGFHTINHTFCTJPVIPFKDGIYFAATEKSNWRGWV 
FGSTMNNKSQSVI I INNSTNWIRACNFELCDNPFFAVSKPMGTQTHTMI FDNAFNCTFE 
YISDAFSLJDVSEKSGNFKHLREFVFKNKDGFLYVYKGYQPIDVVRDIiPSGFNTLKPIFKL 
PLGINITNFRAIIiTAFSPAQDIWGTSAAAYFVGYIjKPTTFMLKYDENGTITDAVDCSQNP 
LAEIjKCSVKSFEIDKGIYQTSNFRWPSGDWRFPNITNLCPFGEVFNATKFPSVYAWER 
KKISNCVADYSVIiYNSTFFSTFKCYGVSATKLNDLCFSNVYADSFWKGDDVRQIAPGQT 
GVIADYNYKLPDDFMGCVIiAWNTRNIDATSTGNYNYKYRYLRHGKLRPFERDI SNVPFS P 
DGKPCTPPALNCYWPLNDYGFYTTTGIGYQPYRVWIjSFElirjNAPATVCGPKLSTDIiIKN 
QCVNFNFNGLTGTGVLTPSSKRFQPFQQFGRDVSDFTDSVRDPKTSEIIiDISPCAFGGVS 
VITPGTNASSEVAVIiYQDVNCTDVSTAIHADQLTPAWRIYSTGNNVFQTQAGCLIGAEHV 
DTSYECDIPIGAGICASYHTVSLLRSTSQKSIVAYTMSIjGADSSIAYSNNTIAIP'rNFSI 
SITTEVMPVSMAKTSVDCNMYICGDSTECANIiLIjQYGSFCTQIjNRALSGIAAEQDRISITRE 
VFAQVKQMYKTPTIjKYFGGFNFSQILPDPLKPTKRSFIEDLDFNKVTLADAGFMKQYGEC 
liGDINARDIilCAQKFNGLTVLPPIiLTDDMIAAYTAAIiVSGTATAGWTFGAGAAIiQIPFAM 
QMAYi^TGIGWQKTTLYENQKQIANQFNKAISQIQESL'PI'TSTAUSPajQDVVNQNAQALN 
TLVKQLSSNFGAISSVIjNDILSRIiDKVEAEVQIDRLITGRIiQSLQTYVTQQLIRAAEIRA 
SANLAATKMSECVLGQSKRVDFCGKGYHLMSFPQAAPHGWFLHVTYVPSQERNFTTAPA 
ICHEGKAYFPREGVFVFNGTSWFITQRNFFSPQI ITTDNTFVSGNCDWIGI INNTVYDP 
LQPELDSB’KEELDKYFKNHTSPDVDLGDISGINASWNIQKEIDRLNEVAKNIjNESLIDL 
OELGKYEQYIKWPWYVWLGFIAGLIAIVMVTILIiCCMTSCCSCIiKGACSCGSCCKFDEDD 
SEPVI.KGVKIiHYT-TNLWICL-DFI..IiIiDQLIiHSQ-KI.TMIiLIiQVLFMj:iQQRYRYKPHSLS 
DGLIiLALHFLLFFRALPK-LRSIKDGS-PFIRASSSFAIYCCYLLPSIHXFCIjSLQVWRR 
NFCTSMP-YIFYNASTHVELL-DVGFVGSANPRTHYFMMPTTIiFAGTHITMTTVYHITVS 
QIQLSLLKVTAFQHQNSKKTTKLWILRIGTQVLKTMSDYMAISPKFTTSLSLHKIiliQTL 
VLKMLHSSSLTSLLKTHRMCKYTQSTALQEIiLIQQWIQFMMSRRRLLACLCKHKKVSTNL 
CTHSFRKKQVR — LIAYFFFLLSWYSC-SH-PSLLRFDCVRTAAILLT-V — NQRFTSTR 
VLKI-TLLKEFLIFWSKRTNYYYYSVWNFNIAYHGRQRYYYR-GA-TTPGTMEPSNRFPI 
PSLDYVTTICLF-SEQVFVHNKACFPLALVASNTCLFCACCCLiQN-LGDWKDCDCNGLYC 
RLDVA-LLRCFLQAVCSYPLNWIQPRNKHSSQCASPGDNCDQTAHGK-TCHWCCDHSWS 
LANGRTLPRAL-H-GPAKElDHCGYITNAFIiLQIRSVAACRH-FRFCCIQPrjPYWKL-IKY 
RPRR-QRQYCFASTVSDNRCFILIiTSRLQ-QRY-LSIi-GIiSGLLFGILTIi — VQ — DNYL 
Sli -liRRI IRS -MMKNLWS - 1 IHKTNMKI ILFLTL IVFTSCELYHYQECVRGTTVLL.KEPC 
PSGTYEGNSPFHPIiADNKFALTCTSTHFAFACADGTRHTYQLRARSVSPKLFIRQEEVQQ 
ELYSPLFLIVAALVFLILCFTIKRKTE-MSSL-LTSICAF-PFCYSLF — CLLYFGFHSK 
SRI-KNLVPKSKRT-NFSIiF-LVFLYAVAYAL-YSAVHLINLMCIjKILVRYNTRGNTYST 
AWIiCALGKVTjPFHRWHTMVQTCTPNVTINCQDPAGGALIARCWYIjHEGHQTAAFRDVI.VV 
LNKRTN-NV — WTPIKPT-CPPHYIWWTHRFN-Q-PEWRTQWGKAKTAPTPRFTQ-YCVL 
VHSSHSAWQGGT- I PSRPGRSNQHQ-WSR- PNWLLPKS YPTS SWW-RQNERAQPQMVLLL 
PRNWPRSFTSLRR-QRRHRMGCN-GSLEYTQRPHWHPQS — QCCHRATTSSRNNIAKRLIi 
RRGKQRRQSSIiFSLIiIT-SR-FiaCFNSWQQ-GKFSCSNG-RRW-NCPRAIAARQIEPA-E 
QSFW-RPTTTRPNCH-EICC-GI-KASPKTYCHKTVQRHSSIWETWSRTNPRKFRGPRPN 
QTRN-LQTIAANCTICSKCLCILWNVTHWHGSHTFGNMADLSWSH-IG-QRSTIQRQRHT 
AEQAH-RIQNIPTNRA-KGQKEKD — SSAFAAETKEAAHCDSSSCG-HG-FLQTTSKFHE 
WSFC-FNSGINTHDDHTRQMGYVNVFAIPFTIHSLLIjCRMNSRN-TAQVGLVNFNIjT-QS 
LINV-H-GGLERATTFSSRPRGVRSRVQ-IMLGRAAYMEEP-CVKLILWLSPCDFNSFL 
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FIGURE 25 

PTPRTLDAVCGCINVFKRVCGVSAARLTPCGTGTSTDWyRAFDIYNEKVAGFAKPLKTNCCRFQE 

KDEEG^^JLDSYFVVKRHTMSNYQHEETIYmlVKDCPAVAVHDFFKFRVDGDMVPHISRQRLTKYTM 

ADLVYALRHFDEGNCDTLKEILVTYNCCDDDYFNKKDWYDFVENPDILRVYANLGERVRQSLLKTV 

QFCDAMRDAGIVGVLTLDNQDIiNGNWYDFGDFVQVAPGCGVPrVDSYYSLLMPILTLTRALAAESH 

MDADLAKPLIKWDLLKYDFTEERLCLFDRYFKYWDQTYHPNCINCLDDRCILHCANFNVLFSTVFP 

PTSFGPLVRKIFVDGVPFWSTGYHFRELGWHNQDVNLHSSRLSFKELLVYAADPAMHAASGNLL 

UDKRTTCFSVAALTNNVAFQTVKPGNFNKDFYDFAVSKGFFmSSSVELKHFFFAQDGNAAISDYD 

YYRYNLPTMCTIRQLLFVVEVVDKYFIXrjroGGCINANQVIVNNLDKSAGFPFNKWGKARLYYDSMS 

YEDQDALFAYTKRNVIPTITQMNLKYAISAKNRARTVAGVSICSTMTNRQFHQKLLKSIAATRGAT 

VVIGTSKFYGGWHNMLKTVYSDVETPHLMGWDYPKCDRAMPNMLRIMASLVLARKHNTCCIJLSHRF 

YRLANECAQVLSEMVMCGGSLYVKPGGTSSGDATTAYANSVFNICQAVTANVNALLSTDGNKIADK 

YVRNLQHRLYECLYRNRDVDHEFVDEFYAYLRKHFSMMILSDDAWCYNSNYAAQGLVASIKNFKA 

VLYYQNNVFMSEAKCWTETDLTKGPHEFCSQHTMLVKQGDDYVYLPYPDPSRILGAGCFVDDIVKT 

DGTLMIERFVSLAIDAYPLTKHPNQEYADVFHLYLQYIRKLHDELTGHMLDMYSVMLTNDNTSRYW 

EPEFYEAMYTPHTVLQAVGACVLCNSQTSLRCGACIRRPFLCCKCCYDHVISTSHKLVLSVNPYVC 

NAPGOJVTOVTQLYLGCMSYYCKSHKPPISFPLCyUSIGQWGLYRNTCVGSDNVTDFNAIATCDWTN 

AGDYILANTCTERLKLFAAETLKATEETFKLSYGIATVREVLSDRELHLSWEVGKPRPPLNRNYVF 
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FIGURE 60 
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FIGURE 65 
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620 

STGNNVFQTQAGCLI 
TCT ACT GGA AAC AAT GTA TTC CAG ACT CAA GCA GGC TGT CTT ATA 

630 640 

GAEHVDT SYECDIPI 
GGA GCT GAG CAT GTT GAT ACT TCT TAT GAG TGC GAC ATT CCT ATT 

650 

G.AGICASYHTOC SEQ ID NO: 9799 

GGA GCT GGC ATT TCT GCT AGT TAG CAT ACA TAA TGAGTCGAC SEQ ID NO; 9800 

Translated Mol. Weight = 72525.52 
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H 
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GGT 
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TTA 
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TTT 


GTG 


TTT 
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180 
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GGG 
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TAT 
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TAT 


CAA 


CCT 
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T 
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GTT 
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TCT 
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TTT 
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ACT 


TTG 
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K 


210 
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I 
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I 


T 


N 


F 


220 
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CCT 


ATT 


TTT 
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TTG 


CCT 


CTT 


GGT 
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ATT 


ACA 


AAT 


TTT 
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T 


GCC 


ATT 


CTT 


ACA 


GCC 


TTT* 


TCA 


CCT 


GCT 
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TCA 


GCT 


GCA 


GCC 
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T 
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GTT 


AGT 


GGT 




















860 












T 


A 


T 


A 


G 


W 


T 


F 


G 


A 


G 


A 


A 


L 


Q 


ACT 


GCC 


ACT 


GCT 


GGA 


TGG 


ACA 


TTT 


GGT 


GCT 


GGC 


GCT 


GCT 


CTT 


CAA 



870 



880 




wo 2004/092360 



119/193 



PCT/US2004/011710 



I 


P 


F 


A 


M 


Q 


M 


A 


Y 


R 


F 


N 


G 


I 


G 
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GGA 
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AAC 
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CTT 
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ACA 
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CAG 
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ATC 


AGG 


GCT 










990 




















1000 


A 


E 


I 


R 


A 


S 


A 


N 


L 


A 


A 


T 


K 


M 


S 


GCT 


GAA 


ATC 


AGG 


GCT 


TCT 


GCT 


AAT 


CTT 


GCT 


GCT 


ACT 


AAA 


ATG 


TCT 




















1010 










E 


C 


V 


L 


G 


Q 


S 


K 


R 


V 


D 


F 


C 


G 


K 


GAG 


TGT 


GTT 


CTT 


GGA 


CAA 


TCA 


AAA 


AGA 


GTT 


GAC 


TTT 


TGT 


GGA 


AAG 










1020 


















1030 


G 


Y 


H 


L 




S 


F 


P 


Q 


A 


A 


P 


H 


G 


V 


GGC 


TAC 


CAC 


CTT 


ATG 


TCC 


TTC 


CCA 


CAA 


GCA 


GCC 


CCG 


CAT 


GGT 


GTT 




















1040 










V 


F 


L 


H 


V 


T 


Y 


V 


P 


s 


Q 


E 


R 


N 


F 


GTC 


TTC 


CTA 


CAT 


GTC 


ACG 


TAT 


GTG 


CCA 


TCC 


CAG 


GAG 


AGG 


AAC 


TTC 










1050 


















1060 


T 


T 


A 


P 


A 


I 


C 


H 


E 


G 


K 


A 


Y 


F 


P 


ACC 


ACA 


GCG 


CCA 


GCA 


ATT 


TGT 


CAT 


GAA 


GGC 


AAA 


GCA 


TAC 


TTC 


CCT 




















1070 










R 


E 


G 


V 


F 


V 


F 


N 


G 


T ‘ 


S 


W 


F 


I 


T 


CGT 


GAA 


GGT 


GTT 


TTT 


GTG 


TTT 


AAT 


GGC 


ACT 


TCT 


TGG 


TTT 


ATT 


ACA 










1080 


















1090 


Q 


R 


N 


F 


F 


s 


P 


Q 


I 


I 


T 


T 


D 


N 


T 


CAG 


AGG 


AAC 


TTC 


TTT 


TCT 


CCA 


CAA 


ATA 


ATT 


ACT 


ACA 


GAC 


AAT 


ACA 




















1100 










F 


V 


S 


G 


N 


C 


D 


V 


V 


I 


G 


I 


I 


N 


N 


TTT 


GTC 


TCA 


GGA 


AAT 


TGT 


GAT 


GTC 


GTT 


ATT 


GGC 


ATC 


ATT 


AAC 


AAC 
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T 


V 


Y 


D 


1110 
P L 


Q 


P 


. E 


L D 


s 


F 


K 


1120 

E 


ACA 


GTT 


TAT 


GAT 


CCT CTG 


CAA 


CCT 


GAG 


CTT GAC 


TCA 


TTC 


AAA 


GAA 


E 


L 


D 


K 


Y P 


K 


N 


H 


1130 
T S 


P 


D 


V 


D 


GAG 


CTG 


GAC 


AAG 


TAC TTC 


AAA 


AAT 


CAT 


ACA TCA 


CCA 


GAT 


GTT 


GAT 


F 


G 


D 


I 


1140 
S G 


I 


N 


A 


S V 


V 


N 


I 


1150 

Q 


TTT 


GGC 


GAC 


ATT 


TCA GGC 


ATT 


AAC 


GCT 


TCT GTC 


GTC 


AAC 


ATT 


CAA 


K 


E 


I 


D 


R L 


N 


E 


V 


1160 
A K 


N 


L 


N 


E 


AAA 


GAA 


ATT 


GAC 


CGC CTC 


AAT 


GAG 


GTC 


GCT AAA 


AAT 


TTA 


AAT 


GAA 


S 


L 


• I 


D 


1170 
I* Q 


E 


L 


G 


K Y 


E 


Q 


Y 


1180 

I 


TCA 


CTC 


ATT 


GAC 


CTT CAA 


GAA 


TTG 


GGA 


AAA TAT 


GAG 


CAA 


TAT 


ATT 


1183 

K W P OC 

AAA TGG CCT TAA 

Translated Mol. 


SEQ ZD BO: 9801 
T6AGTCGAC SEQ ID BO: 9802 

Weight = 131315. 20' 
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FIGURE 73 
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FIGURE 74 

FIGURE 74A L 
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FIGURE 75 


FIGURE 75A 
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FIGURE 79 
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FIGURE 81 




FIGURE 82 
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FIGURE 83 
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FIGURE 85D 



FIGURE 85E 



FIGURE 85F 
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FIGURE 86 
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FIGURE 87 

P65 PET 
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FIGURE 90 
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FIGURE 88 
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FIGURE 92 



FIGURE 93 
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FIGURE 94 




FIGURE 95 




FIGURE 96 





wo 2004/092360 



130/193 



PCT/US2004/011710 



FIGURE 97 



FIGURE 98 



FIGURE 99 




FIGURE 100 
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FIGURE 101 
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FIGURE 103 



FIGURE 104 
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FIGURE 106 




FIGURE 107 

FIGURE 107A FIGURE 107B 




FIGURE 108 

FIGURE 108A FIGURE 108B 
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FIGURE 109 



SI S1-S2 




INFECTED + - + _ 



FIGURE 110 

SI S1-S2 HR2 HR1 
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FIGURE 111 



S1 S2 HR2 HR1 HR1 
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FIGURE 112 
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FIGURE 113 



5 '3' Frame 1 

PKDMTYVDSSL-WVS-ITKSMVTLICLSPMKLFVTFVRGLALM-RAVMQLEMLWVLTYL 
SS-DFLQVLT— LYRLVMLTLKITQNSPELMHKPPPVSSLNILYHSCIKACPGSM-CVLR- 
YKCSVIH-KIX:QTESCSSFGRMALSLHQ-STLSRLDLKERWCVTNVQIiAFLLHQIIiMPA 
GIILWVLTMSim-LMFSSGGFTGMiSE-P-PTLPGTWKCTCGLVVML 



5 '3* Frame 2 

QRT-PT-THLYDGFQNELPSQWLP-YVYHPRRSYSSRSCVDWL-CRGLSCN-RCCGY-PT 

SPARIFYRC-LSSCTDWLC-H-K-HKIHQS-CTNLHQ-AV-TSYTTHV-RLALECSAY-D 

STNAQ-YTERIVRQSRVRPLGAWL-AYINEVLCQDWT-KNVLSV-QTCNLLFYFIRYLCL 

LESFCGF-LCL-PIYD-CSAVGALRVTFQSNHDQHCQVHGNAHVG-L-C 

5 > 3 > Frasae 3 

KGHDLRRLISMMGFKMNYQVNGYPNMFITREEAIRHVRAWIGFDVEGCHATRDAVGTNLP 

LQLGFSTGVNLVAVPTGYVDTElJNTKFTRVNAQTSTSEQFKHLIPIiMYKGLPWNWRIKI 

VQMLSDTLKGLSDRWFVLWAHGFELTSMKYFVKIGPERTCCLCDKRATCFSTSSDTYAC 

WNHSVGFDYVYNPFMIDVQQWGLYG-PFRVTMTNIARYMEMHMWASCDA 

3 '5' Frame 1 

-HHN-PTCAFPCTWQCWSWLL-KVTRKAPTAEHQS-MGYRHSQNPQNDSSRHKYLMK-KS 
KLHVCHTDNTFFQVQS-QSTSLM-AQSHAPKGRTRLCLTILSVYH-AFVLS-YALHSRAS 
LYT-W-DV-TAHWWRFVH-LW-ILCYFQCQHNQSVQLLS-HL-KILAGEVG-YPQHL-L 
HDSPLHQSQSTHERDE-LLRG— TY-GNH-LGNSF-NPS-R-VYVGHVLW 

3 '5' Frame 2 

SITTSPHVHFHVPGNVGHGYSERLPVKPPLLNINHKWVIDIVKTHRMIPAGISI--SRKA 

SCTFVTQTTRSFRSNLDKVLH-CKLKAMRPKDEHDSV-QSFQCITEHLYYLNTHYIPGQA 

FIHEWYKMFKLLTGGGLCINSGEFCVIFSVNITSRYSY-VNTCRKS-LER-VSTHSISSC 

MTALYIKANPRTNVTNSFFAGDKHIRVTIDLVIHFETHHRDEST-VMSF 

3 >5' Frame 3 

ASQLAHMCISMYLMLVMVTLKGYP-SPHC-TSIINGL-T-SKPTE-FQQA-VSDEVEKQ 

VARLSHRQHVLSGPILTKYFIDVSSKPCAQRTNTTLSDNPFSVSLSICTILIRTTFQGKP 

LYMSGIRCLNCSLVEVCALTLVNFVLFSVST-PVGTATKLTPVENPSWRGRLVPTASLVA 

-QPSTSKPIHART-RIASSRVINILG-PLTW-FILKPIIEMSLRRSCPL 
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FIGURE 114 



5 '3* Frame 1 

YRRLISMMGFKMNYQVNGYPNMFITREEAIRHVRAWIGFDVEGCHATRDAVGTNLPLQLG 

FSTGmVAVPTGYVDTENNTEFTRVNAKPPPGDQFKHLI 

5*3' Frame 2 

TVDSSL-WSK-ITKSMVTLICLSPAKKLFVTFVRGLALM-RAVMQLEMLWVLTYLSS-D 
FLQVLT- -LmVMLTLKITQNSPELMQNLHQVTSLNILY 

5 >3' Frame 3 

P-THLYDGFQNELPSQWLP-YVYHPRRSYSSRSCVDWL-CRGLSCN-RCCGY-PTSPARI 

FYRC-LSSCTDWLC-H-K-HRIHQS-CKTSTR-PV-TSYT 

3*5* Frame 1 

GIRCLNWSPGGGFALTLVNSVLFSVST-PVGTATKLTPVENPSWRGRLVPTASLVA-QPS. 

TSKPIHART-RIASSRVINILG-PLTW-FILKPIIEMSLR 

V . - 

3*5' Frame 2 

V-DV-TGHLVEVLH-LW-ILCYFQCQHNQSVQLLS-HL-KILAGEVG-YPQHL-LHDSPL 
HQSQSTHERDE-LLRG— TY-6NH-LGNSF-NPS-R-VYG 



3 >5' Frame 3 

Y10y[FKLVTWWRFCINSGEFCVIFSVNITSRYSY-WCRKS-LER-VSTHSISSCMTA^^ 

IKANPRTNVTNSFFAGDKHIRVTIDLVIHFETHHRDESTV 
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FIGURE 115 
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— Section 152 
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( 1 ) 1 
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..■■■ Section 153 
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(5840) 

( 1 ) 

(5929) YDNKLKAKND SSLCFKVYYKG 



TTHESSSAVNMQQ 
Section 154 



(5968) 5968 




( 1 ) 



(5968) IHLI KFLKANP W AVFISPYNSQNF AKRVLGLQTQ 
— Section 155 
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- - 







( 1 ) 
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- — Section 156 
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FIGURE 115 (contd.) 
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6319 



mm 



FIGURE 115 (contd.) 



mm 



Section 163 
6357 



(6262) 



(6358) CLAVHDCFCN VNWNLEYPIISNELSViSTTSCRLLQRVML 
Section 165 

(6397) 6397 §410 



(6397) KAAMLCNRY VCYDIGNPKGIACVK FDFKFYDANPI 
— Section 166 

(6436) 6436 



(6436) VKSVKQFLY YBAHKD F DGLCMFWNCNVDKYP NAVV 
I Section 167 

(6475) 6475 



(6475) CRFDTRVLN LNLPGCNGGSLYVNKHAFHT PFSRAAFE 
Section 168 



(6514) 6514 6029 

6332) nW ‘ - - 

5901 ) m 

6416) ML 

(229) ■ 

(6514) NLKPMPFFYYSDTPCVYMDGMDAKQVDYVPLKSATCITR 
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(6553) 65^ mm - 



FIGURE 115 (conid,) 






Section 169 
6591 




(6455) w 
(229) 



(6553) CNLGGAVCLKHAEEYREYLESYNTATTAGFTFWVYKTFD 
— Sectfon 170 



(6592) 6592 
(6410) 
(5978) 
(6494) 

(229) 



,6600 



6610 



6870 



6630 




(6592) FYNLWNTFTKLQBLENVVYNLVKAGHYDG AGEMPCAII 



(6631) 6631 
(6449) E 
(6017) 
(6533) 

(229) 






Section 171 
6669 




(6631) GDKVIAKIQ BDVVVFINNTTFPTNVAVELFAKRSIR H 
— — Section 172 



(6670) 6670 
(6488) 




(229) 



(6670) PELKLFRNLNIDVCW HVIWDYAKDSIFCSNTYKVC YT 
— — — — Section X7Z 



(6709) 6709 




(6611) _ _ 

(229) ----- — — — ~ _ I “ I _ r.-- 

(6709) DL ID LNVLFDGRDNGALEAFKKA NGVYISTTKIKS 

^ ^ , — . .. ■ Section 174 

(6748) 67^ JS76Q 6770. 6786 






(6566) l SEQ ID NO: 10068 

(6132) Y§|^s - — -- -- SEQ ID NO: 10069 

(6650) iooto 

(229) - - - - - - - - — - - - - - — - - - - - - SEQ ID NO: 9997/98 

(6748) L SMI KG P RADLNGVVVDKVGDSD FWFAVRKDGNDVI 8EQIDNO:10071 
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FIGURE 118 (contd.) 

320 330 5 



< Section 7 
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^ wo; |.iSM f. V 

( 307 ) TSCRLLQRVMLKAAMLCNRYTVCyblGNPkGIACVK bPDFkFYbANPiv 

— • ■ -" - ■■ — r ..^ tf . , ., — — — Section 8 

(358)358 







(405) 

(409) PGCNGGSLYVNkHAFHTKPFSRAAFENLKPMPFFYYSbTPCVYMDGMDAKQ 
— Section 10 

tm 

(457) 

(456) 

(460) VDYVPLkSATCITRCNLGGAVCLKHAEEYREYLESYNTATTAGFTFWVYKT 

i n . I I ■ , , i f ,. - II. I * . I . — Section 1 1 









■fsm -s’H : • 

SEQ ID NO: 10073 
SEQ ID NO: 10074 
SEQ ID NO: 10075 
SEQ ID NO: 10076 

(511) k DFYNLWN T FTKLQ SEQ ID NO: 10077 








wo 2004/092360 PCT/US2004/011710 

144/193 

FIGURE 119 

tagtcaaaacccacagaatgattccagcaggcataagtatctgatgaagtagaaaagcaa 

-VSDEVEKQ 

gttgcacgtTTGtcacacagacaacacgttctttcaggtccaatcTTGacaaagtacttc 
VA RJ)_S H R Q H V L S G P IJ^T K Y F 

attgatgtaagctcaaagccatgcgcccaaaggacgaacacgactctgtctgacaatcct 
I DVSSKPCAQRTNTTLSDNP 

ttcagtgtatcactgagcatttgtactatcttaatacgcactacattccagggcaagcct 
F S V S L S I C T I L I R T T F Q G K P 

ttatacATGagtggtataagatgtttaaactgctcactggtggaggtttgtgcattaact 
L Y Jl_ S G I RC LNCSLVEVCA LT 

Ctggtgaattttgtgttattttcagtgtcaacataa SEQ ID NO: 10080 
L V N F V L F S V S T - SEQ ID NO: 10027 



FIGURE 120 

FIGURE 120A 

PRHTQRT-PTVDSSL-WVSK-ITKSMVTLICLSPAKKLFVTFVRGLALM-RAVMQ 
LEMLWVLTYLSS-DFLQVLT— LYRLVMLTLKITQNSPELMQNLHQVTSLNILYH 
SCIKACPGM-CVLR-YKCSVIH-KDCQTESCSSFGRMALSLHQ-STLSRLDLKER 
WCTOJVQLAFLLHQILMPAGIILWVLTMSITHL-UyiFSSGALRWFRVTMTNIA 
RYMEMHMWLVVMLS-LDV-QSMSALLSALIGLLNTLL-EMN-GLILLAEKYNTWL 
-SLHCLLISFQFFMT-EIQRLSSVCLRLK-NGSSTMLSHWTKLTK-RNSSILML 
YITINSLMVFVCFGIVTLIVTQPMQLCVGLTQESCQT-TYQAVMVWCM-ISMHS 
TLQLSIKVHLLI-SNCLSFTILIVLVSLMANK-CRILIMFHSNLLRVLHDAI-W 
LFADTMQMSTDSTWMHII--FLLDLAYGFTNNLILITCGIHLPGYRV 

FIGURE 120B 

LGIPKGHDLP-THLYDGFQNELPSQWLP-YVYHPRRSYSSRSCVDWL-CRGLSCN 
-RCCGY-PTSPARIFYRC-LSSCTDWLC-H-K-HRIHQS-CKTSTR-PV-TSYTT 
HV-RLALECSAY-DSTNAQ-YTERIVRQSRVRPLGAWL-AYINEVLCQDWT-KNV 
LSV-QTCNLLFYFIRYLCLLESFCGF-LCL-PIYD-CSAVGLYG-PSE-P-PTLP 
GTWKCTCG-L-CYHD-MFSSP-VLC-AR-LVC-IPYYRR-TEG-FCLQKSTTHGC 
EVCIAC— VSSSS-HRKSKGYQVCASG-SRMEVLRCSAM— QSLQNRGTLLFLCY 
TSR-IH-WCLFVLEL-R-SLPSQCNCV-V-HKSLVKLELTRL-WW-FVCE-ACIP 
HSSFR-KCIY-FKAIAFLLLF— SL-VSWQTSSVGY-LCSTQICYVYYTMQFRWC 
CLQTPCK- VPTVLGCI - YDDFCWI - PMDLQTI - YL- PVEYI YQVTEF 
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FIGURE 120C 

-AYPKDMTYRRLISMMGFKMNYQVNGYPNMFITREEAIRHVRAWIGFDVEGCHATRDAVG 

TNLPLQLGFSTGVNLVAVPTGYVDTENNTEFTRVNAKPPPGDQFKHLIPLMYKGLPWNW 

RIKIVQMLSDTLKGLSDRWFVLWAHGFELTSMKYFVKIGPERTCCLCDKRATCFSTSSD 

TYAOWNHSVGFDYVYNPFMIDVQQWGFTGNLQSNHDQHCQVHGNAHVASCDAIMTRCLAV 

HECFVKRVDWSVEYPIIGDELRVNSACRKVQHMWKSALLADKFPVLHDIGNPKAIKCVP 

QAEVEWKFYDAQPCSDKAYKIEELFYSYAIHHDKFTDGVCLFWNCNVDRYPANAIVCRFD 

TRVLSNIiNLPGCDGGSLYVNKHAFHTPAFDKSAFTNLKQLPFFYYSDSPCESHGKQWSD 

IDYVPLKSATCITRO)IIjGGAVCRHHANEYRQYLDAY]!®mSAGFSLWIYKQFI>miLWNT 

FTRLQSL 

FIGURE 120D 

-TL-PGKCIPQVISIKLFVNP-AKSSRNHHIICIQVLSVLICMVSANSTT-IASCNTRSR 

FEWNIINIRHYLFAMRLTRTIRIVKERQLL-ISKCTFIESWSVECMLIHIQTTTITAW-V 

QV-QDSCVKPTMCIGWVTIlWTIPKQTOTISEFIVMYSIRIEEFLYFVSFVTTlAILSIVE 

LPFYFSLRHTLDSLWI SYVMKNWKLISKQCRLHNHVLYFSASRINPQFI SYNRVFNRPIN 

ALNKALMTC-TSSHDSITTSHMCISMYLAMLVMVTLKVTRKAPLmiNHKWVIDIVKTHR 

MIPAGISI— SRKASCTFVTQTTRSFRSNLDKVLH-CKLKAMRPKDEHDSV-QSFQCITE. 

HLYYLNTHYIPGQAFIHEWYKMFKLVTWWRFCINSGEFCVIFSVNITSRYSY-VNTCRKS. 

-LER-VSTHSISSCMTALYIKANPRTNVTNSFFAGDKHIRVTIDLVIHFETHHRDESTVG 

HVLWVCL 

FIGURE 120E 

KLCNLVNVFHRL-VSNCL-IHRLNPAEIIILYASKYCRYSFAWCLQTAPPKLHRVIHVAD 
LSGT-SISDTTCLP-DSQGLSE— KKGNCFKLVNALLSKAGVWNACLFTYKLPPSQPGKF 
KFDKTLVSNLHTIALAG-RSTLQFQNKQTPSVNLS-CIA-E-KSSSIL-ALSLHG-AS-N 
FHSTSA-GTHLIAFGFPMS-RTGNLSASNADFTTMCCTFLQAELTLSSSPIIGYSTDQST 
RLTKHSWTAKHLVMIASQLATCAFPCTWQCWSWLL-RLPVKPHC-TSIINGL-T-SKPTE 
-FQQA-VSDEVEKQVARLSHRQHVLSGPILTKYFIDVSSKPCAQRTNTTLSDNPFSVSLS 
ICTILIRTTFQGKPLYMSGIRCIiNWSPGGGFALTLVNSVLFSVST-PVGTATKLTPVENP 
SWRGRLVPTASLVA-QPSTSKPIHART-RIASSRVINILG-PLTW-FILKPIIEMSLR-V 
MSFGYA- 

FIGURE 120F 

NSVTW-MYSTGYKYQIVCKSIG-IQQKSSYYMHPSTVGTHLHGVCKQHHLNCIV-YT-QI 
-VEHNQYPTLLVCHETHKDYQNSKRKAIALN— MHFYRKLECGMHAYSHTNYHHHSLVSS 
SLTRLLCQTYTQLHWLGNDQRYNSKTNKHHQ-IYRDV-HKNRRVPLFCKLCHYMAEHRRT 
SILLQPEAHT— PLDFLCHEELETYQQAMQTSQPCWLFCKQN-PSVHLL— GIQQTNQR 
A-QSTHGLLNI-S— HHN-PHVHFHVPGNVGHGYSEGYP-SPTAEHQS-MGYRHSQNPQN 
DSSRHKYLMK-KSKLHVCHTDNTFFQVQS-QSTSLM-AQSHAPKGRTRLCLTILSVYH-A 
FVLS-YALHSRASLYT-W-DV-TGHLVEVLH-LW-ILCYFQCQHNQSVQLLS-HL-KIL 
AGEVG-YPQHL-LHDSPLHQSQSTHERDE-LLRG— TY-GNH-LGNSF-NPS-R-VYGRS 
CPLGMPR 
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FIGURE 121 

10 20 30 40 50 60 



SEQ ID NO: 10033 TA(XGTAGACTCATCTCTATGATGGGTTKy^^ 

SEQ ID NO: 10084 CCTAGGCATACCCAAAGGACATGACCTACCGTAGACTCy^TCTCTATGATGGGTTTCAAAA 
Consensus TACXIGTAGACntATCrcTATGATGGC?^^ 

Prim. cons. CCH'AGGCATACXZCAAAGGACaTGACCTACCGTAGACTCATCnCTATGATGGGTTTCAAAA 



70 80 90 100 no 120 

I I I I I I 

SEQ ID NO: 10033 T6AATTACCAAGTCAATGGTTACCCTAATATGTTTATCACCCGCGAAGAAGCTATTCGTC 
SEQ ID NO: 10084 TGAATTACCAAGTC7VATGXOTACCCTAATAT(JITrAT(:ACCCGCGAAGAAGCTATTC(?rC 
Consensus TGAATTACCAAGTCAATGGTTACCCTAATATGTTTATCACCCGCGAAGAAGCTATTCCTC 

Prim. cons. TGAATTACCAAGTCAATGGTTACCCTAATATCTTTATCACCCGCGiAAGAAGCTATTCGTC 

130 140 150 160 170 180 

I I I I I I 

SEQ ID NO: 10033 ACGTTCffrGCGTGGATTGGCTTTGATGTAGAGGGCTGTCATGCAACTAGAGATGCTGTGG 
SEQ ID NO: 10084 ACGTTCGTGCCTGGATTGGCTTTGATGTAGAGGGCTGTCATGCAACTAGAGATGCTGTGG 
Corisrasus ACGTTCffTCCGTGGATTGGCTTTGATGTAGAGGGCTCKXlATGCAACTAGAGATGCTOrGG 

Prim. cons. • ACfflTCfflXX^GTGGATrGGCTTTGATGTAGAGGGCTGTCATGCAACTAGAGATGCTGI^ 

190 200 210 220 230 240 

I III I I 

SEQ ID NO: 10033 GTACTAACCTACCTCTCCAGCTAGGATTTTCTACAGGTGTTAACTTAGTAGCTGTACCGA 
SEQ ID NO: 10084 GTACTAACCTACCTCTCCAGCTAGGATTTTCTACAGGTGTTAACTTAGTAGCTGTACCGA 
Consensus GTACTAACCTACCTCTCCAGCTAGGATTTTCTACAGGTGTTAACTTACTAGCTGTACCGA 

Prim. cons. GOACTAACCTACCTCTCCAGCTAGGATTTTCTACAGGTGTTAACTTACTAGCTGTACCGA 

250 260 270 280 290 300 

I I I I I I 

SEQ ID NO: 10033 CTG(?raATGTTGACACTGAAAATAACACAGAATTCACCAGAGTTAATGCAAAACCTCCAC 
SEQ ID NO: 10084 CTGGTTATOTTGACACTGAAAATAACACAGAATTCACCAGAGTTAATGCAAAACCTCCAC 
Consensus CTGGTTATGTTGACACTGAAAATAACACA6AATTCACCAGAGTTAATGCAAAACCTCCAC 

Prim. cons. CTGGTTATGTTGACACTGAAAATAACACA6AATTCACCAGAGTTAATGCAAAACCTCCAC 

310 320 330 340 350 360 

I I I I I I 

SEQ ID NO: 10033 CAGGTGACCAGTTTAAACATCTTATACC 

SEQ ID NO: 10084 CAGGTGACCAGTTTAAACATCTTATACCACTCATCTATAAAGGCTTGCCCTGGAATGTAG 
Consensus CAGGTGACCAGTITAAACATCTTATACC 

Prim. cons. CAGGTGACCAGTTTAAACATCTTATACCACTCATGTATAAAGGCTTGCCCTGGAATGTAG 



etc. 
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FIGURE 122 

5 ' 3 ' Frame 1 

cctaggcatacccaaaggacatgacctaccgtagactcatctctatgatgggtttcaaaa 
PRHTQRT-PTVDSSL - WVSK 
tgaattaccaagtcaatggttaccctaatatgtttatcacccgcgaagaagctattcgtc 
-ITKSMVTLICLSPAKKL FV 
acgttcgtgcgtggattggctttgatgtagagggctgtcatgcaactagagatgctgtgg 
TFVRGLALM- .RAVMQLEMLW 
gtactaacctacctctccagctaggattttctacaggtgttaacttagtagctgtaccga 
VLTYLSS-DFLQVLT--LYR 
ctggttatgttgacactgaaaataacacagaattcaccagagttaatgcaaaacctccac 
LVMLTLKITQNSPEL MQNLH 
caggtgaccagtttaaacatcttataccactcatgtataaaggcttgccctggaatgtag 
QVT SLN I LY HSC I KAC PGM - 
tgcgtattaagatagtacaaatgctcagtgatacactgaaaggattgtcagacagagtcg 
C VLR-YKCSVIH-KDCQTES 
tgttcgtcctttgggcgcatggctttgagcttacatcaatgaagtactttgtcaagattg 
C S S F G RMAL S L H Q - S T L S R L 
gacctgaaagaacgtgttgtctgtgtgacaaacgtgcaacttgcttttctacttcatcag 
DLK ERVVCVTNVQLAFLLH Q 
atacttatgcctgctggaatcattctgtgggttttgactatgtctataacccatttatga 
ILMPAGI I.LWVLTMS ITHL - 
ttgatgttcagcagtggggctttacgggtaaccttcagagtaaccatgaccaacattgcc 
LMF S S GALRVT FRVTMTN’I A 
aggtacatggaaatgcacatgtggctagttgtgatgctatcatgactagatgtttagcag 
R YMEMHMWLVVML S - LDV - Q 
tccatgagtgctttgttaagcgcgttgattggtctgttgaataccctattataggagatg’ 
SMSALLSALIGL LNTLL-EM 
aactgagggttaattctgcttgcagaaaagtacaacacatggttgtgaagtctgcattgc 
N-GLILLAEKYN TWL - SLHC 
ttgctgataagtttccagttcttcatgacataggaaatccaaaggctatcaagtgtgtgc 
LLISFQFFMT-EIQRLSSVC 
ctcaggctgaagtagaatggaagttctacgatgctcagccatgtagtgacaaagcttaca 
LRLK-NGSSTMLSHVVTKLT 
aaatagaggaactcttctattcttatgctatacatcacgataaattcactgatggtgttt 
K-RNSSILMLYITINSLMVF 
gtttgttttggaattgtaacgttgatcgttacccagccaatgcaattgtgtgtaggtttg 
VCFGIVTLIVTQPMQLCVGL 
acacaagagtcttgtcaaacttgaacttaccaggctgtgatggtggtagtttgtatgtga 
TQESCQT-TYQAVMVVVCM- 
ataagcatgcattccacactccagctttcgataaaagtgcatttactaatttaaagcaat 
ISMHSTLQLSIKVHLLI-SN 
tgcctttcttttactattctgatagtccttgtgagtctcatggcaaacaagtagtgtcgg 
CLSFTILIVLVSLMANK-CR 
atattgattatgttccactcaaatctgctacgtgtattacacgatgcaatttaggtggtg 
ILIMFHSNLLRVLHDAI-VV 
ctgtttgcagacaccatgcaaatgagtaccgacagtacttggatgcatataatatgatga 
LFADTMQMSTDSTWMHI I - - 

tttctgctggatttagcctatggatttacaaacaatttgatacttataacctgtggaata 
FLLDLAYGFTNNLILITCGI 
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catttaccaggttacagagttta 
H L P G Y R V 



5*3 ' Frame 2 

cctaggcatacccaaaggacatgacctaccgtagactcatctctatgatgggtttcaaaat 
LGIPKGHDLP-THLYDGFQN 
gaattaccaagtcaatggttaccctaatatgtttatcacccgcgaagaagctattcgtca 
EL PSQWLP-YVYHPRRSYSS 
cgttcgtgcgtggattggctttgatgtagagggctgtcatgcaactagagatgctgtggg 
RSCVDWL - CRGLS CN - RCCG 
tactaacctacctctccagctaggattttctacaggtgttaacttagtagctgtaccgac 
Y-PTSPARIFYRC-LSSCTD 
tggttatgttgacactgaaaataacacagaattcaccagagttaatgcaaaacctccacc 
W L C - H - K - HR I H Q S - C KT S T 

aggtgaccagtttaaacatcttataccactcatgtataaaggcttgccctggaatgtagt 
R-PV-TSYTTHV-RLALECS 
gcgtattaagatagtacaaatgctcagtgatacactgaaaggattgtcagacagagtcgt 
A Y - D S T N A Q - YTERIVRQSR 

gttcgtcctttgggcgcatggctttgagcttacatcaatgaagtactttgtcaagattgg 
VRP L GAWL - AYI NEVLCQDW 
acctgaaagaacgtgttgtctgtgtgacaaacgtgcaacttgcttttctacttcatcaga 
T-KNVLSV-QTCNLLFYFIR 
tacttatgcctgctggaatcattctgtgggttttgactatgtctataacccatttatgat 
YLCLLESFCGF-LCL-PIYD 
tgatgttcagcagtggggctttacgggtaaccttcagagtaacce.tgaccaacattgcca 
-CS AVGLYG-PSE-P-PTLP 
ggtacatggaaatgcacatgtggctagttgtgatgctatcatgactagatgtttagcagt 
GTW.KCTCG-L-C YHD-MFSS 
ccatgagtgctttgttaagcgcgttgattggtctgttgaataccctattataggagatga 
P-VLC- AR-LV 'C-IPYYR R- 
actgagggttaattctgcttgcagaaaagtacaacacatggttgtgaagtctgcattgct 
TEG-FCLQ KSTTHGCEVCIA 
tgctgataagtttccagttcttcatgacataggaaatccaaaggctatcaagtgtgtgcc 
C--VSSSS-HRKSKGYQVCA 
tcaggctgaagtagaatggaagttctacgatgctcagccatgtagtgacaaagcttacaa 
SG-SRMEVLRCSAM - - QSLQ 

aatagaggaactcttctattcttatgctatacatcacgataaattcactgatggtgtttg 
NRGTLLFLCYTSR-IH-WCL 
tttgttttggaattgtaacgttgatcgttacccagccaatgcaattgtgtgtaggtttga 
FVL EL -R-SL PSQCNCV-V- 
cacaagagtcttgtcaaacttgaacttaccaggctgtgatggtggtagtttgtatgtgaa 
HKSLVKLELTRL-WW-FVCE 
taagcatgcattccacactccagctttcgataaaagtgcatttactaatttaaagcaatt 
-ACIPHSSFR-KCIY-FKAI 
gcctttcttttactattctgatagtccttgtgagtctcatggcaaacaagtagtgtcgga 
AFLLLF - - SL-VSWQTSSVG 

tattgattatgttccactcaaatctgctacgtgtattacacgatgcaatttaggtggtgc 
Y-LCSTQICYVYYTMQFRWC 
tgtttgcagacaccatgcaaatgagtaccgacagtacttggatgcatataatatgatgat 
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CLQTPCK-VPTVLGCI-YDD 
ttctgctggatttagcctatggatttacaaacaatttgatacttataacctgtggaatac 
FCWI-PMDLQTI-YL-PVEY 
atttaccaggttacagagttta 
I Y Q V T E F 



5*3* Frame 3 

cctaggcatacccaaaggacATGacctaccgtagactcatctctatgatgggtttcaaaatg 
-AY PKDM TYRRLISMMGFKM 
aattaccaagtcaatggttaccctaatatgtttatcacccgcgaagaagctattcgtcac 
NYQVNGYPNMF ITREEAIRH 
gttcgtgcgtggattggctttgatgtagagggctgtcatgcaactagagatgctgtgggt 
VRAWIGFDVEGCHATRDAVG 
actaacctacctctccagctaggattttctacaggtgttaacttagtagctgtaccgact 
T NLPLQLGFSTGVNLVAVPT 
ggttatgttgacactgaaaataacacagaattcaccagagttaatgcaaaacctccacca 
GYVDTENNTEFTR VNAKPPP 
ggtgaccagtttaaacatcttataccactcatgtataaaggcttgccctggaatgtagtg 
G DQFKHL I PLMYKGL PWNVV 
cgtattaagatagtacaaatgctcagtgatacactgaaaggattgtcagacagagtcgtg 
RIKIVQML .S'DTLKGLSDRVV 
ttcgtcctttgggcgcatggctttgagcttacatcaatgaagtactttgtcaagattgga 
fvlwahg'felt SMKYFVKIG 
cctgaaagaacgtgttgtctgtgtgacaaacgtgcaacttgcttttctacttcatcagat 
PERTCC LCDK RA T CF STS SD 
acttatgcctgctggaatcattctgtgggttttgactatgtctataacccatttatgatt 
TYACWNHSVG FDYVYNPFMI 
gatgttcagcagtggggctttacgggtaaccttcagagtaaccatgaccaacattgccag 
DV QQWGFTGN LQSNHDQHCQ 
gtacatggaaatgcacatgtggctagttgtgatgctatcatgactagatgtttagcagtc 
VHGNAHVASCOAIMTRCLAV 
catgagtgctttgttaagcgcgttgattggtctgttgaataccctattataggagatgaa 
HECFVKRVDWSV EYP I IGDE 
ctgagggttaattctgcttgcagaaaagtacaacacatggttgtgaagtctgcattgctt 
LRVNSACRKVQHMVVKSAL L 
gctgataagtttccagttcttcatgacataggaaatccaaaggctatcaagtgtgtgcct 
ADKFPVLHDIGNPKAIKCVP 
caggctgaagtagaatggaagttctacgatgctcagccatgtagtgacaaagcttacaaa 
QAEVEWKFYDAQPCSDKAYK 
atagaggaactcttctattcttatgctatacatcacgataaattcactgatggtgtttgt 
lEELFYSYAIHHDKFTDGVC 
ttgttttggaattgtaacgttgatcgttacccagccaatgcaattgtgtgtaggtttgac 
LFWNCNVDRYPANAIVCRFD 
acaagagtcttgtcaaacttgaacttaccaggctgtgatggtggtagtttgtatgtgaat 
TRVL SN LNLPG C DGG SLYVN 
aagcatgcattccacactccagctttcgataaaagtgcatttactaatttaaagcaattg 
KHAFHTPAFDKSAFTNLKQL 
cctttcttttactattctgatagtccttgtgagtctcatggcaaacaagtagtgtcggat 
PFFYYSDSPCESHGKQVVSD 
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attgattatgttccactcaaatctgctacgtgtattacacgatgcaatttaggtggtgct 
IDYVPLKSATCITRCNLGGA 
gtttgcagacaccatgcaaatgagtaccgacagtacttggatgcatataatatgatgatt 
VCRHHANEYRQYLDAYNMMI 
tctgctggatttagcctatggatttacaaacaatttgatacttataacctgtggaataca 
SAGFSLWIYKQFDTYNLWNT 
tttaccaggttacagagttta 
F T R L Q S L 



3 '5' Frame 1 

taaactctgtaacctggtaaatgtattccacaggttataagtatcaaattgtttgtaaat 
-TL-PG KCIPQVISIKLFVN 
ccataggctaaatccagcagaaatcatcatattatatgcatccaagtactgtcggtactc 

P - AKS SR NHH I I C I QVL SVL 

atttgcatggtgtctgcaaacagcaccacctaaattgcatcgtgtaatacacgtagcaga 
ICM VSANSTT - lASCNTRSR 
tttgagtggaacataatcaatatccgacactacttgtttgccatgagactcacaaggact 
F EWNI INIRHYLFAMRLTRT 
atcagaatagtaaaagaaaggcaattgctttaaattagtaaatgcacttttatcgaaagc 

I RI VK ERQLL - I SKCTF I ES 

tggagtgtggaatgcatgcttattcacatacaaactaccaccatcacagcctggtaagtt 
WSVECMLIHIQTTTITAW-V 
caagtttgacaagactcttgtgtcaaacctacacacaattgcattggctgggtaacgatc 
QV-.QDS CVKPTHNC IGWVT I 
aacgttacaattccaaaacaaacaaacaccatcagtgaatttatcgtgatgtatagcata 
NVT.IPK QTNTISEFIVMYSI 
agaatagaagagttcctctattttgtaagctttgtcactacatggctgagcatcgtagaa 
RIEEFLYFVSFVTTWLSIVE 
cttccattctacttcagcctgaggcacacacttgatagcctttggatttcctatgtcatg 
LPFY FSL RHTLDSLWISYVM 
aagaactggaaacttatcagcaagcaatgcagacttcacaaccatgtgttgtacttttct 
KNWKL I S KQCRLH NHVLY F S 
gcaagcagaattaaccctcagttcatctcctataatagggtattcaacagaccaatcaac 
ASR.INPQFI SYNRVFNRP IN 
gcgcttaacaaagcactcatggactgctaaacatctagtcatgatagcatcacaactagc 
ALNKAL. MDC-TSSHDSITTS 
cacatgtgcatttccatgtacctggcaatgttggtcatggttactctgaaggttacccgt 
HMC I SMYL AMLVMVTLKVTR 
aaagccccactgctgaacatcaatcataaatgggttatagacatagtcaaaacccacaga 
KA PLLNINHKWVIDIVKTHR 
atgattccagcaggcataagtatctgatgaagtagaaaagcaagttgcacgtttgtcaca 
MIPAGISI--SRKASCTFVT 
cagacaacacgttctttcaggtccaatcttgacaaagtacttcattgatgtaagctcaaa 
QTTRSFRSNLDKVLH-CKLK 
gccatgcgcccaaaggacgaacacgactctgtctgacaatcctttcagtgtatcactgag 
AMRPKDEH DSV - Q S FQC I TE 
catttgtactatcttaatacgcactacattccagggcaagcctttatacatgagtggtat 
HLYYLNTHYIPGQAFIHEWY 
aagatgtttaaactggtcacctggtggaggttttgcattaactctggtgaattctgtgtt 
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KMFKLVTWWRFC INSGEFCV 
attttcagtgtcaacataaccagtcggtacagctactaagttaacacctgtagaaaatcc 
IFSVNITSRYSY-VNTCRKS 
tagctggagaggtaggttagtacccacagcatctctagttgcatgacagccctctacatc 
-LER-VSTHSISSCMTALYI 
aaagccaatccacgcacgaacgtgacgaatagcttcttcgcgggtgataaacatattagg 
KANPRTNVTNSFFAGDKHIR 
gtaaccattgacttggtaattcattttgaaacccatcatagagatgagtctacggtaggt 
VTIDLVIHFETHHRDESTVG 
catgtcctttgggtatgcctagg 
H V L W V C L 



3 • 5 • Frame 2 

taaact.ctgtaacctggtaaatgtattccacaggttataagtatcaaattgtttgtaaatc 
KLCNLVNVFHRL - VSN CL - I 
cataggctaaatccagcagaaatcatcatattatatgcatccaagtactgtcggtactca 
HRLNPAEIIILYASKYCRYS 
tttgcatggtgtctgcaaacagcaccacctaaattgcatcgtgtaatacacgtagcagat 
FAWCLQTAPPKLHRVIHVAD 
ttgagtggaacataatcaatatccgacactacttgtttgccatgagactcacaaggacta 
L S G.T - S I S DT T C L P - D S Q GL 

tcagaatagtaaaagaaaggcaattgctttaaattagtaaatgcacttttatcgaaagct 
SE - - KKGNCFKLVNALLSKA 

ggagtgtggaatgcatgcttattcacatacaaactaccaccatcacagcctggtaagttc 
GVWNACLFTYKLPPSQPGKF 
aagtttgacaagactcttgtgtcaaacctacacacaattgcattggctgggtaacgatca 
KFDKTLVSNLHTIALAG-RS 
acgtcacaattccaaaacaaacaaacaccatcagtgaatttatcgtgatgtatagcataa 
TLQFQNKQTPSVNLS-CIA- 
gaatagaagagttcctctattttgtaagctttgtcactacatggctgagcatcgtagaac 

E KSSSIL ALSLRG — AS — N 

ttccattctacttcagcctgaggcacacacttgatagcctttggatttcctatgtcatga 
FHSTSA-GTHLIAFGFPMS- 
agaactggaaacttatcagcaagcaatgcagacttcacaaccatgtgttgtacttttctg 
RTGNLSASN ADFTTMCC, TFL 
caagcagaattaaccctcagttcatctcctataatagggtattcaacagaccaatcaacg 
QAELTLSSSPIIGYST DQST 
cgcttaacaaagcactcatggactgctaaacatctagtcatgatagcatcacaactagcc 
RLTKHSWTAKHLVMIASQLA 
acatgtgcatttccatgtacctggcaatgttggtcatggttactctgaaggttacccgta 
TCAFPCTWQCWSWLL-RLPV 
aagccccactgctgaacatcaatcataaatgggttatagacatagtcaaaacccacagaa 
KPHC-TSIINGL-T-SKP TE 
tgattccagcaggcataagtatctgatgaagtagaaaagcaagttgcacgtttgtcacac 
-fqqa-vsdevekqvarlsh 
agacaacacgttctttcaggtccaatcttgacaaagtacttcattgatgtaagctcaaag 
RQHVLSGPILTKYFIDVSSK 
ccatgcgcccaaaggacgaacacgactctgtctgacaatcctttcagtgtatcactgagc 
PCAQRTNTTLSDNPFSVSLS 
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atttgtactatcttaatacgcactacattccagggcaagcctttatacatgagtggtata 
ICTILIRTTFQGKPLYMSGI 
agatgtttaaactggtcacctggtggaggttttgcattaactctggtgaattctgtgtta 
RCLNWSPGGGFALTLVNSVL 
ttttcagtgtcaacataaccagtcggtacagctactaagttaacacctgtagaaaatcct 
FSVST - PVGTATKLTPVENP 
agctggagaggtaggttagtacccacagcatctctagttgcatgacagccctctacatca 
SWRGRL VPTASLVA-QPSTS 
aagccaatccacgcacgaacgtgacgaatagcttcttcgcgggtgataaacatattaggg 
KPIHART - RIASSRVINILG 
taaccattgacttggtaattcattttgaaacccatcatagagatgagtctacggtaggtc 
-PLT W-FILKPIIEMSLR-V 
atgtcctttgggtatgcctagg 
M S F G Y A - 



3*5' Frame 3 

taaactctgtaacctggtaaatgtat'tccacaggttataagtatcaaattgtttgtaaatcc 
N SV TW - MY S TGYKYQ IVCK S 
ataggctaaatccagcagaaatcatcatattatatgcatccaagtactgtcggtactcat 
IG- I QQKSSYYM HPSTVGTH 
ttgcatggtgtctgcaaacagcaccacctaaattgcatcgtgtaatacacgtagcagatt 
LHGV CKQHHLNC IV - YT - Q I 
tgagtggaacataatcaatatccgacactacttgtttgccatgagactcacaaggactat 
-VSHNQYPTLLVCHETHKDY 
cagaatagtaaaagaaaggcaattgctttaaattagtaaatgcacttttatcgaaagctg 
Q NS KRKAIALN - - MHFYRKL 

gagtgtggaatgcatgcttattcacatacaaactaccaccatcacagcctggtaagttca 
ECGMHAYSHTNYHHHSLVSS 
agtttgacaagactcttgtgtcaaacctacacacaattgcattggctgggtaacgatGaa 
SLTRLLCQTYTQLHWLGNDQ 
cgttacaattccaaaacaaacaaacaccatcagtgaatttatcgtgatgtatagcataag 
RYNSKTNKHHQ-IYRDV-HK 
aatagaagagttcctctattttgtaagctttgtcactacatggctgagcatcgtagaact 
NRRVPLFCKLCHYMAEHRRT 
tccattctacttcagcctgaggcacacacttgatagcctttggatttcctatgtcatgaa 
SILLQPEAHT--PLDFLCHE 
gaactggaaacttatcagcaagcaatgcagacttcacaaccatgtgttgtacttttctgc 
ELETYQQ AMQTSQPCVVLFC 
aagcagaattaaccctcagttcatctcctataatagggtattcaacagaccaatcaacgc 
KQN-PSVHLL--GIQQTNQR 
gcttaacaaagcactcatggactgctaaacatctagtcatgatagcatcacaactagcca 
A-QSTHGLLNI-S--HHN-P 
catgfegcatttccatgtacctggcaatgttggtcatggttactctgaaggttacccgtaa 
HVHFHVPGNVGHGYSEGYP- 
agccccactgctgaacatcaatcataaatgggttatagacatagtcaaaacccacagaat 
SPTAEHQS-MGYRHSQNPQN 
gattccagcaggcataagtatctgatgaagtagaaaagcaagttgcacgtttgtcacaca 
DSSRHKYLMK-KSKLHVCHT 
gacaacacgttctttcaggtccaatcttgacaaagtacttcattgatgtaagctcaaagc 
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DNTFFQVQ S-QSTSLM-AQS 
catgcgcccaaaggacgaacacgactctgtctgacaatcctttcagtgtatcactgagca 
HAPKGRTRLCLTILSVYH-A 
tttgtactatcttaatacgcactacattccagggcaagcctttatacatgagtggtataa 
FVLS-YALHSRASLYT-VV- 
gatgtttaaactggtcacctggtggaggttttgcattaactctggtgaattctgtgttat 
DV-TGHLVEVLH-LW-ILCY 
tttcagtgtcaacataaccagtcggtacagctactaagttaacacctgtagaaaatccta 
FQCQHNQSVQLLS-HL-KIL 
gctggagaggtaggttagtacccacagcatctctagttgcatgacagccctctacatcaa 
AGEVG-YPQH L-LHDSPLHQ 
agccaatccacgcacgaacgtgacgaatagcttcttcgcgggtgataaacatattagggt 
SQSTHERDE-LLRG--TY-G 
aaccattgacttggtaattcattttgaaacccatcatagagatgagtctacggtaggtca 
NH-LGNSF-NPS-R-VYGRS 
tgtcctttgggtatgcctagg 
C P L G M P R 
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FIGURE 123 



CCTAG(X:ATACCCAAAGGACATGACCTA(XmGACrcATCTCTATGATGGGTTTCAAAAT^ 

i .N..N i N.... 

TAC(X:TAATAT6TTTATCACCCBCGAAGAAGOTATTCGTCAa3TTCGT(X:GTGGATTGGCTimTGTAGAGG^ 

i ,..N N 

T(mm6AGAT6CTGTGGGTACTAA(X:TACCT<n€CAGCTAGGATTTTCTACAGGTGTTAACm6TAGCTCTAa 

N 

CnGGTTATGTT6ACACTGAAAATAA(mCAGAATTCACCAGAGTTAATGa^CCTCCACCAG^^ 

N N 

OTATACCACTCAT6TATAAAGGCTTGO:CTGGAATGTAGTGCGTATTAA6ATAGTA(yiAATGCTCAGT6^ 

N N N 

AGGATTGTCAGACAGAGTOSTGTTCGTCCTTTGGGaxyVTGGCTTmGCOTACATC^ 

N N 

GACCTGAAAGAAC6TGTTGTCTGTGT<aCAAACGTGCAACTTGCTTTT^ 

N 

CATTCTGTG(mTTGACTATGTCTATAACC(y^TmTGATTGATGTTCAG(a^GTGGGG(^ 

N N N 

TAACCAT(acma^TTGCCAGGTA(yiTGGAAATGaLCATGTGGCTAGTTGmTGCTATCATGACTA6ATGTTTAG(yiG . 

N N N N N i N 

TCCATGAGTGCTTTCTTAAGCGCGTTGATTGGTCTGTTGAATACCCTAmTAGGAGATGAACTGAGGGTTAATTCTGOT • 

...N N 

TGCAGAAAA6TACAACACATGGTTGTGAAGTCTGCATTGCTTG(nGATAAGTTrcCAGTTCTTCATGACATAGGAAATC^ 

i N.. 

AAAGGCTATCAAGTGTGTGCCTCAGGCimGTAGAATGGAAGTTCTACGATGCTCAGCCATGTAGTGACAA^ 
N N N 

N N 

GTmTCGTTACCCAG(mTG(yATTGTGTGTAGGTTTGACA(aAGAGT(TOTCAAACT^^ 

N ,...N 

TGGTGGTAGTTTGTATGTGAATAAGCATG(aiTTCCACACTCCAGOTTTCGATAAAAGTGCAm 

N N 

TGCCTTTCTTTTACTATPCTGATAGTCCITCTGAGTCmTGGCAAAC^^ 

i.... N 

AAATCTGCTACGTGTAmCACGATGCAAmAGGTGGTG(nGTTTGCAGACaCCATGCAAATGAGT^^^ 

N N N 

GGATGCATATAATATGATGATTTCTGCIGGATTTAGCCTATGGATTTACaAACAATTIGATACITATAAC 

..N N..N N 

CATTTACCAGGTTACAGAGTTTA SEQ ID KO; 10084 
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FIGURE 123 (conid.) 



Pos 


Score 


2X 


0 . 651 


45 


0.354 


48 


0.387 


60 


0 . 590 


76 


0 . 470 


90 


0.676 


145 


0 . 192 


160 


0 . 410 


172 


0 .290 


247 


0.221 


286 


O . 219 


333 


O . 373 


355 


0.178 


381 


0 .286 


439 


0 . 405 


459 


0 .204 


547 


0 .289 


580 


O . 447 


597 


O . 449 


604 


O .290 


646 


0 . 427 


667 


0 . 427 


673 


0.208 


679 


0.317 


694 


0.180 


702 


0 . 554 


710 


O . 151 


724 


0.384 


778 


O . 151 


819 


0 . 711 


865 


0.306 


917 


0.230 


931 


0.214 


941 


0 . 190 


985 


0 . 274 


1012 


0.368 


1060 


0.206 


1120 


0 . 193 


1135 


0 . 185 


1147 


0 . 431 


1240 


0 . 562 


1270 


0 . 377 


1304 


0 . 190 


1336 


0.353 


1342 


0.312 


1363 


0 .213 


1374 


0.178 


1377 


0 . 096 


1400 


0 . 056 



Pared 

Yes 

Yes 

Yes 



Yes 

Yes 



Yes 
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FIGURE 124 



Sequences : 



gi 

gi 

gi 

gi 

gi 

gi 

gi 

gi 

gi 

gi 

gi 

gi 

gi 

gi 

gi 

gi 

gi 

gi 

gi 

gi 

gi 

gi 

gi 

gi 

gi 

gi 



74827 |pir I jVFIHJH genome polyprotein lb - murine hepatit. 
14917044|sp|P29982|RRPB_CVMJH RNA-directed ENA polymeras. 
26007546|ref |NP_068668.2| ORFlab polyprotein [Murine hep. 
7769342 gb AAF69332.1 AF208066_2 RNA-directed RNA polyme. 
6625761 gb AAF19384.1 AF201929_2 RNA-directed RNA polyme. 
2641128 gb AAB86818.1 RNA-directed RNA polymerase [muri. 
4377413 emb|CAA36202.l| open reading frame lb (AA 1-2733. 
133592|sp|P16342|RRPB_CVMA5 RNA-DIRECTED RNA POLYMERASE . 

ref |np_ 150073 .2 1 orf lab polyprotein [Bovine cor. 



26008080 

15077820 

18033972 



gb 

gb 



AAK83365.1 

AAL57305.1 



replicase [bovine coronavirus] 
replicase [bovine coronavirus] 



7769353|gb|AAF69342.l|AF208067_2 RNA-directed RNA polyme. 
17529672 gb|AAL40397.l|AF220295_2 RNA polymerase lb [bov. 
25121571 ref NP_740618.'l coronavirus nspll [Murine hepa. 

26008092 ref NP_742140.1 coronavirus nspll [Bovine coro. 

10242469 ref NP_066134.1 ORFlab polyprotein; frameshift. 
14149033 emb CAC39112.l| replicase polyprotein lab [Avia. 
458735 @iib|CAA83018.l| potential chimeric protein [Avian. 
133594 sp|P26314|RRPB_IBVB RNA-DIRECTED RNA POLYMERASE (. 
29293454 |gb|AAO67706.1 1 ORFlb polyprotein [Avian inf ecti. 

ref | np_ 740631.1| coronavirus nspll [Avian infec. 

replicase [Transmissible gastro. 
Poll [porcine epidemic diarrhe. 
replicase polyprotein lab [Hum. 
13359l|sp|P18458|RRPB_BEV RNA-directed RNA polymerase (0. 
151306l|dbj | baa 13323.1| cyanoprotein alpha subunit precu. 



251215551 
9635157 1 ref |NP_058422 . 1 1 



19387582 

12175747 



ref 

ref 



NP_598309.1 

NP073549.1 



Alignments 



(bits) Value 



638 


0.0 


637 


0.0 


637 


0.0 


637 


0.0 


637 


0.0 


635 


0.0 


634 


0.0 


634 


0.0 


633 


e-180 


633 


e-180 


633 


e-180 


633 


e-180 


623 


e-177 


622 


e-177 


617 


e-175 


575 


e-163 


575 


e-163 


570 


e-161 


570 


e-161 


565 


e-160 


559 


e-158 


545 


e-153 


541 


e-152 


535 


e-151 


50 


8e-05 


35 


3.7 



>gi|74827|pir| |VFIHJH genome polyprotein lb - murine hepatitis virus 
(strain JHM) 

Length =2731 

Score = 638 bits (1645), E:qpect = 0.0 

Identities = 287/481 (59%), Positives = 366/481 (76%), Gaps = 5/481 (1%) 

Query: 6 MTYRRLISMMGFKMNYQVNGYPNMFITREEAIRHVRAWIGPDVEGCHATRDAVGTNLPLQ 65 
+TY RLIS+MGFK++ ++GY +FITR+EAI+ VRAW+GFD EG HATRD++GTN PLQ 
Sbjct: 1585 VTYSRLISLMGFKLDLTLDGYCKLFITRDEAIKRVRAWVGPDAEGAHATRDSIGTNFPLQ 1644 

Query: 66 LGFSTGVNLVAVPTGYVDTENNTEFTRVNAKPPPGDQFKHLIPLMYKGLPWNWRIKIVQ 125 
LGFSTG++ V TG + F + A+ PPG+QFKHL+PLM +G W+WRI+IVQ 
Sbjct: 1645 LGFSTGIDFWEATGMFAERDGYVFKKAAARAPPGEQFKHLVPLMSRGQKWDWRIRIVQ 1704 

Query: 126 MLSDTLKGLSDRWFVLWAHGFELTSMKYFVKIGPERTCCLCDKRATCFSTSSDTYACWN 185 
MLSD L L+D W V WA PELT ++YF K+G E C +C+KRATCF++ + Y CW 
Sbjct: 1705 MLSDHLVDLADSWLVTWAASFELTCLRYFAKVGKEWCSVCNKRATCFNSRTGYYGCWR 1764 

Query: 186 HSVGFDYVYNPFMIDVQQWGFTGNLQSNHDQHCQVHGNAHVASCDAIMTRCLAVHECFVK 245 
HS DY+YNP ++rM-QQW6+TG+L SNHD C VH AHVAS DAIMTRCLAVH+CF K 
Sbjct: 1765 HSYSCDYLYNPLIVDIQQWGYTGSLTSNHDPICSVHKGAHVASSDAIMrRCLAVHDCFCK 1824 
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Query: 246 RVDWSVEYPIIGDELRVNSAOIKVQHMVVKSAIjLADKFPVLHDIQNPKAIKCVPQAEVEW 305 
V+W++EYPII +E+ VN++CR +Q ++ ++A+L +++ V +DIG!NPK + CV ++ 
Sbjct: 1825 SVNWNLEYPIISNEVSVNTSCRLLQRVMFRAAMLCNRYDVCYDIGNPKGLACVKG--YDF 1882 

Query: 306 KFYDAQPCSDIOira:EBI,FYSYAIHHDKFTIX3VCLFWN(3JVDRYPANAIVaOT7n^^ 365 
KFYDA P +++ Y Y H D+P IXS+C+FWNOIVmYPAia+VairoTR^ 

Sbjct: 1883 KPYDASPV---VKSVKQFVYKYEAHKDQFLIX5LaiFMCNVDKYPANAWCRFiyiRVLSK 1939 

Query: 366 LNLPGCDGGSLYVNKHAFHTPAFDKSAFTNLKQLPFFYYSDSPCESHGKQWSDIDYVPL 425 
LNLPGC+GGSLYVNKHAFHT F ++AF NLK +PFFYYSD+PC +DYVPL 

Sbjct: 1940 LNLPGCNGGSLYVNKHAFHTNPFTRAAFENLKPMPFFYYSDTPCVYMEGMESKQVDYVPL 1999 

Query: 426 KSATCITRCNLGGAVOfflHANEYRQYLDAYNMMISAGFSLWIYKQFDTYNLVmTTRLQS 485 
+SATCITRCNLGGAVC HA EYR+YL++YN +AGF+ W+YK FD YNLWNTFTRLQS 
Sbjct: 2000 RSATCITRCmGGAVajKHAEEYREYLESYOTATTAGFTFWVYKTFDFYNLWNTFTRLQS 2059 

Query: 486 L 486 
L 

Sbjct: 2060 L 2060 



>gi I 14917044 ) sp | P29982 | RRPB_CVMJH RNA-directed RNA polymerase (ORFlB) 
gi|7583321|gb|AAA46458.2| open reading frame lb [murine hepatitis virus] 
Length = 2731 

Score = 637 bits (1644), Expect = 0.0 

Identities = 287/481 (59%), Positives =‘366/481 (76%), Gaps = 5/481 (1%) 

Query: 6 MTYRRLISMMGPKMNYQWGYPNMFITREEAIRHVRAWIGFDVEGCHATRDAVGTNLPLQ 65 
+TY RLIS+MGFK++ ++GY +FITR+EAI+ VRAW+GFD EG HATRD++GTN PLQ 
Sbjct: 1585 VTYSRLISLMGFKLDLTLDGYCKLFITRDEAIKRVRAWVGFDAEGAHATRDSIGTNFPLQ 1644 

Query: 66 I/3FSTGVNLVAVPTGYVDTENNTEFTRVNAKPPPGDQFKHLIPIiMYKGLPWNVVRIKIVQ 125 
LGFSTG++ V TG + F + A+ PPG+QFKHL+PLM +G W+WRI+IVQ 
Sbjct: 1645 LGFSTGIDFWEATGMFAERDGYVFKKAAARAPPGEQFKHLVPLMSRGQKWDWRIRIVQ 1704 

Query: 126 MLSDTLKGLSDRVVFVLWAHGFELTSMKYFVKIGPERTCCLCDKRATCFSTSSDTYACWN 185 
MLSD L L+D "W V WA FELT ++YF K+G E C +C+KRATCF+4 + Y CW 
Sbjct: 1705 MLSDHLVDLADSVVLVTWAASFELTCLRYFAKVGKEVVCSVCNKRATCFNSRTGYYGCWR 1764 

Query: 186 HSVGFDYVYNPFMIDVQQWGFTGNLQSNHDQHCQVHGNAHVASCDAIMrRCLAVHECFVK 245 
HS DY+YNP ++D+QQWG+TG+L SNHD C VH AHVAS DADTTRCLAVH+CF K 
Sbjct: 1765 HSYSCDYLYNPLIVDIQQWGYTGSLTSNHDPICSVHKGAHVASSDAIMTRCLAVHDCFCK 1824 

Query: 246 RVDWSVBYPIIGDELRVNSACI^QHMVVKSALLADI^PVLHDICaJPKAIKCVPQAEVEW 305 
V+W++BYPII +E+ VN++CR +Q ++ ++A+L +++ V +DIGNPK + CV ++ 
Sbjct: 1825 SVNWNLEYPIISNBVSVNTSCRLLQRVMFRAAMLCNRYDVCYDIGNPKGLACVKG— YDP 1882 

Query: 306 KFYDAQPCSDKAYIQEELFYSYAIHHDKFTIXlVCLFW^aNVDRYPANAIVCRFDTRVLSN 365 
KFYDA P +++ Y Y H D4F DG+C+FWNCNVD+YPANA+VCRFDTRVLS 

Sbjct: 1883 KPYDASPV VKSVKQFVYKYEAHKDQFLDGLCMFWNCNVDKYPANAWCRPDTRVLSK 1939 

Query: 366 LNLPGCDGGSLYVNKHAFHTPAFDKSAFTNLKQLPFFYYSDSPCESHGKQWSDIDYVPL 425 
LNLPGC+GGSLYVNKHAFHT F ++AF NLK +PFFYYSD+PC +DYVPL 

Sbjct: 1940 LNLPGCNGGSLYVNKHAFHmPFTRAAFENLKPMPFFYYSDTPCVYMEGMESKQVDYVPL 1999 
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Query: 426 KSATCITROtLGGAVOfflHANEYRQYIiDAYNMMISAGFSIjSnYKQFDTYNLWNTFTRIjQS 485 
+SATCITRGNLGGAVC HA EYR+YL++YN +AGF+ W+YK FT) YNLWNTFTRLQS 
sbjct: 2000 RSATCITRasnjGGAVCLKHAEEYREYLESYOT'ATTAGFTFWVYKTFDFYNLWNTFTRIKJS 2059 

Query: 486 L 486 

L 

Sbjct: 2060 L 2060 



>gi|26007546lref |NP_068668.2| ORFlab polyprotein [Murine hepatitis virus] 

Length = 7178 

Score = 637 bits (1644), Expect =0.0 

Identities = 286/481 (59%), Positives = 364/481 (75%), Gaps = 5/481 (1%) 

Query: 6 MTYRRLISMMGFKMNYQVNGYPNMFITREEAIRHVRAWIGFDVEGCHATRDAVGTNLPLQ 65 
+TY RLIS+MGFK++ ++6Y +PITR+EAI+ VRAW+GFD EG HA RD++GTN PLQ 
Sbjct: 6032 VTYSMjISIMSFHiDLTLDGYCiaiFITRDEAIKRVRAWVGFDAEGAHAIRDSIGTNFPIjQ 6091 

Query: 66 IfiFSTGVNLVAVPTGYVDTENNTEFTRVNAKPPPGDQFKHLIPLMYKGLPWNVVRIKIVQ 125 
LGFSTG++ V TG + F + A+ PPG+QFKHLIPLM +G W+WRI+IVQ 
Sbjct: 6092 LGFSTGIDFWEATGMFAERD6YVFKKAAARAPPGEQFKHLIPLMSRGQKWDWRIRIVQ 6151 

Query: 126 MLSDTLKGLSDRWFVLWAHGFELTSMKYFVKIGPERTCCLCDKRATCFSTSSDTYACWN 185 
MLSD L L+D W V WA FELT ++YF K+G E C +C KRATCF++ + Y CW 
Sbjct: 6152 MLSDHIADLADSVVLVTWAASFELTCLRYFAKVGREVVCSVCTKRATCFNSRTGYYGCWR 6211 

Query: 186 HSVGFDYVYNPFMIDVQQWGFTGNLQSNHDQHCQVHGNAHVASCDAIMrRCLAVHECFVK 245 
HS DY+YNP ++IHQQWG+TG+L SNHD C VH AHVAS DAIMTRCLAVHH-CF K 
Sbjct: 6212 HSYSCDYLYNPLIVDIQQWGYTGSLTSNHDPICSVHKGAHVASSnAIMrRCLAVHDCFCK 6271 

Query: 246 RVDWSVEYPIIGDMJIYNSACRKVQHMVVKSALLADKFPVLHDIGNPKAIKCVPQAEVEW 305 
V+W++EYPII +E+ VN++CR +Q ++ ++A+L +++ V +DIGNPK + CV ++ 
Sbjct: 6272 SVNWMLEYPIISNEVSVNTSOlLLQRVMFRAAMLCimYDVCYDIGNPKGLACVKG--YDF 6329 

Query: 306 KFYDAQPCSDKAYKIEELFYSYAIHHDKFTDGVCLPWNCNVDRYPANAIVCRFDTRVLSN 365 
KFYDA P +++ Y Y H D+F DG+C+FWNCNVD+YPANA+VCRFDTRVL+ 

Sbjct: 6330 KFYDASPV VKSVKQFVYKYEAHKDQFLDGLCMFWNOIVDKYPANAWa^DTRVLNK 6386 

Query: 366 LNLPGCDGGSLYVMKHAFHTPAFDKSAFTNLKQLPFFYYSDSPCESHGKQWSDIDYVPL 425 
LNLPGC+GGSLYVNKHAFHT F ++AF NLK +PFFYYSD+PC +DYVPL 

Sbjct: 6387 LNLPGCNGGSLYVNKHAFHTSPFTRAAFENLKPMPFFYYSDTPCVYMEGMESKQVDYVPL 6446 

Query: 426 KSATCITROsn^GAVCRHHANEYRQYLDAYMMMISAGFSLVniYKQFDTYNLWNTFTRLQS 485 
+SATCITRCNLGGAVC HA EYR+YL++YN +AGF+ W+YK FD YNLWNTFTRLQS 
Sbjct; 6447 RSATCITRCNLGGAVCLKHAEEYREYLESYNTATTAGFTFWVYKTFDFYNLWNTFTRLQS 6506 

Query: 486 L 486 
L 

Sbjct: 6507 L 6507 

>gi [7769342 lgblAAF69332.1 1 AF208066_2 RNA-directed RNA polymerase [murine 

hepatitis virus] 

Length = 2732 
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Score = 637 bits (1644), Expect =0.0 

Identities = 287/481 (59%), Positives = 366/481 (76%), Gaps = 5/481 (1%) 

Query: 6 MTYIUdilSMMGFKMNYQVNGYPNMFITREEAIRHVRAWIGFDVEGCHATRDAVGTNLPIjQ 65 
+TY RLIS+MGFK++ ++GY +FITR+EAIR VRAW+GFD EG HATRD++GTN PLQ 
Sbjct: 1586 VTYSRLISmGPKIJDLTLIX3YCia.FITRDEAIRRVRAWVGFDAEGAHATRDSIGTNPPLQ 1645 

Query: 66 LGFSTGVNLVAVPTGYVDTEIOTEFTRVNAKPPPGDQPKHLIPIiMYKGLPWNVVRIKIVQ 125 
LGFSTG++ V TG + F + A+ PPG+QFKHL+PLM +G W+WRI+IVQ 
Sbjct: 1646 LGFSTGIDFVVEATGMFAERDGYVFKKAYARAPPGEQFKHLVPLMSRGQKWDVVRIRIVQ 1705 

Query: 126 MLSDTLKGl^DRVVFVLWAHGFELTSMKYFVKEGPERTCCLCDKRATCFSTSSDTyACWN 185 
MLSD L L+D W V WA FELT ++YF K+G E C +C+KRATCF++ + Y CW 
Sbjct: 1706 MLSDHLVDLADSVVLVTWAASFELTCLRYFAKVGKEVVCSVCNKRATCFNSRTGYYGCWR 1765 

Query: 186 HSVGFDYVYNPFMIDVQQWGFTGNLQSNHDQHCQVHGNAHVASCnAIMTRCLAVHECFVK 245 
HS DY+YNP ++D+QQWG+TG+L SNHD C VH AHVAS DAIMTRCLAVH+CF K 
Sbjct: 1766 HSYSCDYLYNPLIVDIQQWGYTGSLTSNHDLICSVHKGAHVASSDAIMTRCLAVHDCFCK 1825 

Query: 246. RVDWSVEYPIIGDELRVNSACRKVQHMWKSAIiLADKFPVLHDIGNPKAIKCVPQAEVEW305 
V+WS+EYPII +E+ VN++CR +Q ++ ++A+L +++ V +DIGNPK + CV ++ 
Sbjct: 1826 SVNWSLEYPIISNEVSVNTSCRLIiQRVMFRAAMriCNRYDVCYDIGNPKGIiACVKG--YDF 1883 

Query: 306 KFYDAQPCSDKAYKIEELFYSYAIHHDKFTDGVCLFWNCNVDRYPANAIVCRFDTRVLSN 365 • 
KFYDA P +++ Y Y H D+F DG+C+FWNa^VI>^YPANA+VCRFDTRVL+ 

Sbjct: 1884 KFYDASPV VKSWQFVYKYEAHKDQFUXaCMFMCNVDKYPANAWCRFDTRVLNK 1940 

Query: 366 LNLPGCDGGSLYVNKHAFHTPAFDKSAFTNLKQLPFFYYSDSPCESHGKQWSDIDYVPL 425 
LNLPGC4GGSLYVNKHAFHT F ++AF NLK +PFFYYSD+PC • +DYVPL 

Sbjct: 1941 LNLPGCNGGSLYVNKHAFHTSPFTRAAFENLKPMPFFYYSDTPCVYMEGMESKQVDYVPL 2000 

Query: 426 KSATCITROJLGGAVCRHHANEYRQYIiDAYNMMISAGFSIiWIYKQFDTYNLWNTFTRLQS 485 
+SATCITRCNI 1 GGAVC HA +YR+YIH-+YN +A6F+ W+YK FD YNLWNTFTRLQS 
Sbjct: 2001 RSATCITRCaniGGAVCTiKHAEDYREYLESYNTATTAGFTFWVYKTFDFYNLWNTFTRLQS 2060 

Query: 486 L 486 
L 

Sbjct: 2061 L 2061 , 



>gi|662576ljgb|AAF19384.l|AF201929_2 RNA-directed RNA polymerase [murine 
hepatitis virus strain 2] 

gi|7739595jgb|AAF68920.l|AF207902_2 RNA-directed RNA polymerase [murine 
hepatitis virus strain ML-11] 

Length = 2733 

Score = 637 bits (1643), Expect = 0.0 

Identities = 287/481 (59%) , Positives = 366/481 (76%) , Gaps = 5/481 (1%) 

Query: 6 MTYRRLISMMGFKMNYQVNGYPNMFITREEAIRHVRAWIGFDVEGCHATRDAVGTNLPLQ 65 
+TY RLIS+MGFK++ ++GY +FITR+EAIR VRAW+GFD EG HATRD++GTN PLQ 
Sbjct: 1587 VTYSRLISLMGFKLDLTLDGYCKLFITRDEAIRRVRAWVGFDAEGAHATRDSIGTNFPLQ 1646 

Query: 66 LGFSTGVNLVAVPTGYVDTENNTEFTRVNAKPPPGDQFKHLIPLMYKGLPWNWRIKIVQ 125 
LGFSTG++ V TG + F + A+ PPG+QFKHL+PLM +G W+WRI+IVQ 
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Sbjct: 1647 IX^ST6IDFVVEATGMFAERIX3YVFKKAVARAPPGEQPKHLVPLMSRGQKWDVVRIRIVQ 1706 

Query: 126 MLSimiKGLSDRVVFVLWMIGFELTSMKYFVKIGPERTCa^KRATCPSTSSiyiTAaflN 185 
MLSD L L+D W V WA FELT ++YF K+G E C +C+KRATCF++ + Y CW 
Sbjct: 1707 MLSDHLVDLADSVVLVTWAASFELTCa^YFAia/GKEVVCSVCmRATCFNSRTG^ 1766 

Query: 186 HSVGFDYVYOTFMIDVQQWGFTQ^LQSNHDQHCQVHGMAHVASOTAIMrRCLAVHECFVK 245 
HS DY+YNP ++D+QQWG+TG+L SNHD C VH AHVAS DAIMTRCLAVH+CF K 
Sbjct: 1767 HSYSCDYLYNPLIVDIQC^GYTGSLTSNHDLICSVHKGAHVT^SDAIMTRCLAVHDCFCK 1826 

Query: 246 RVTMSSVBYPIIGDELRVNSAOlKVQHMVVKSALLADKFPVIiHDIGNPKAIKCVPQAEVEW 305 
V+WS+EYPII +E+ VN++CR +Q ++ ++A+L +++ V +DIGNPK + CV ++ 
Sbjct: 1827 SVNWSLEYPIISNEVSVNTSCRLLQRVMFRAAMLCNRYDVCYDIGNPKGLACVKG— YDF 1884 

Query: 306 KFYDAQPCSDKAYKIEEIJYSYAIHHDKFTDGVCn^FWNaJVDRYPANAIVCRFDTRVLSN 365 
KFYDA P +++ Y Y H D+F DG+C+FWNaNVD+YPANA+VC3^FDTR\^^ 

Sbjct: 1885 KFYDASPV---VKSVKQFVYKYEAHKDQFl^LQ!FWNCNVDKyPANAWCRFDTRVLNK 1941. 

Query: 366 LNLPGCDGGSLYVNKHAFHTPAFDKSAFTNIiKQLPPFYYSDSPCESHGKQWSDIDYVPL 425. 

LNLPGC+GGSLYVNKHAFHT F ++AF NLK +PFFYYSD+PC +DYVPL 

Sbjct: 1942 LNLPG(3TGGSLYVNKHAFHTSPFTRAAFENLKPMPFFYYSDTPCVYMEGMBSKQVDYVPL 2001 

Query: 42.6 KSATCITRCNLGGAVaiHHANEYRQYLDAYNMMISAGFSLWIYKQFDTYNLWNTFTRLQS 485 
+SATCITRCNLGGAVC HA +YR+YL++YN +AGF+ W+YK FD YNLWNTFTRLQS 
Sbjct: 2002 RSATCITRCinjGGAVa^KHAEDYREYLBSYNrATTAGFTFWVYKTFDFYNLVJNTFTRLQS 2061 

Query: 486 L 486 
L 

Sbjct: 2062 L 2062 

>gil2641128|gb|AAB86818.l| RNA-directed RNA polymerase [murine hepatitis 
virus] 

Length = 2733 

Score = 635 bits (1637), Expect = 0.0 

Identities = 286/481 (59%), Positives = 364/481 (75%), Gaps = 5/481 (1%) 

Query: 6 MTYRRLISMMGFKMNYQVNGYPNMFITREEAIRHVRAWIGFDVEGCHATRDAVGTNLPLQ 65 
+TY RLIS+MGFK++ ++GY +FITR+EAI+ VRAW+GFD EG HA RD++GTN PLQ 
Sbjct: 1587 VTYSRLISLMGFKLDLTLDGYCKLFITRDEAIKRVRAWVGFDAEGAHAIRDSIGTNFPLQ 1646 

Query: 66 LGFSTGVNLVAVPTGYVDTENNTEFTRVNAKPPPGDQFKHLIPLMYKGLPWNVVRIKIVQ 125 
LGPSTG++ V TG + F + A+ PPG+QFKHLIPLM +G W+WRI+IVQ 
Sbjct: 1647 LGFSTGIDFWEATGMFAERDGYVFKKAAARAPPGEQFKHLIPLMSRGQKWDWRIRIVQ 1706 

Query: 126 MLSDTLKGLSDRVVFVLWAHGFELTSMKYFVKIGPERTCCLCDKRATCFSTSSDTYACWN 185 
MLSD L L+D W V WA FELT ++YF K+G E C +C KRATCF++ + Y CW 
Sbjct: 1707 MLSDHLADLADSWLVTWAASFELTCLRYFAKVGREWCSVCTKRATCFNSRTGYYGCWR 1766 

Query: 186 HSV6FDYVYNPFMIDVQQWGFTGNLQSNHDQHCQVHGNAHWASCDAIMTRCLAVHECFVK 245 
HS DY+YNP ++D+QQWG+TG+L SNHD C VH AHVAS DAIMTRCLAVH+CF K 
Sbjct: 1767 HSYSCDYLYNPLIVDIQQWGYTGSLTSNHDPICSVHKGAHVASSDAIMTRCLAVHDCFCK 1826 

Query: 246 RVDWSVBYPIIGDELRVNSACRKVQHMWKSALLADKFPVLHDIGNPKAIKCVPQAEVEW 305 
V+W++BYPII +E+ VN++CR +Q ++ ++A+L +++ V +DIGNPK + CV ++ 
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Sbjct: 1827 SVNWNLEYPIISNEVSVOTSCrajIiQRVMETlAftMLa)RYDVCYDIGNPKGLACVKG--YDP 1884 

Query: 306 KFYIAQPCSDKAYKIBELFYSYMHHDKFTIXSVCLFWNCNVDRYPMAIVCRFDTR^ 365 
KFYDA P +++ Y Y H D+F IXS+C+FMOJVIHYPANAWCRFDTO 

Sbjct: 1885 KFYDASFV^--VKSVKQFVYKYEAHKDQPLIX3IOIFWNCNVDKYPi^WaiFI)T^^ 1941 

Query: 366 LNLPGCDGGSLYVNKHAFHTPAFDKSAFTNLKQLPFFYYSDSPCESHGKQWSDIDYVPL 425 
LNLPGC+GGSLYVNKHAFHT F ++AF NLK +PFFYYSD+PC +DYVPL 

Sbjct: 1942 l^PGCWGGSLYVNKHAFHTSPFTRAAFENLKPMPFFYYSDTPCVYMEGMESKQVDYVPL 2001 

Query: 426 KSATCITRCNLGGAVOIHHMIEYRQYLDAYNMMISAGFSLWIYKQFDTYNLWOTTTPRLQS 485 
+SATCITRCNLGGAVC HA EYR+YL++YN +AGF+ W+YK FD YNLWNTFTRLQS 
Sbjct: 2002 RSATCITRCmGGAVOiKHABEYREYI^SYOTATTAGPTBWYKTFDFYNLWNTFTRIiQS 2061 

Query: 486 L 486 
L 

Sbjct: 2062 L 2062 



>gi 1 4377413 |emb|CAA36202 -l| open reading frame lb (AA 1-2733) [Murine 
hepatitis virus] 

Length = 2733 

Score - 634 bits (163.6), E:^ect = 0.0 

•Identities = 286/481 (59%), Positives = 364/481 (75%), Gaps = 5'/481 (1%) 

Query: 6 MTYRRLISMMGFKMNYQVNGYPNMFITREEAIRHVRAWIGFDVEGCHATRDAVGTNLPLQ 65 
+TY RLIS+MGFK++ ++GY +FITR+EAI+ VRAW+GFD EG HA PJH+GTN PLQ 
Sbjct: 1587 VTYSPJ.ISLl®3FKIiDLTLDGYCKIiFITRDEAIKRVRAWVGFDAEGAHAIRDSIGTNFPLQ 1646 

Query: 66 IXSFSTGVNLVAVPTGYVDTENOTEFTRVNAKPPPGDQFKHLIPLMYKGLPWNVVRIKIVQ 125 
.LGFSTG4+ V TG + F + A+ PPG+QFKHLIPLM +G W+WRI+IVQ 
Sbjct: 1647 LGFSTGIDFWEATGMFAERDGYVFKKAAARAPPGEQFKHLIPLMSRGQKWDWRIRIVQ 1706 

Query: 126 MLSimJCGLSDRVVFVLWAHGFELTSMKYFVKIGPERTCXILCDKRATCFSTSSDTYACWN 185 
MLSD L L+D W V WA FELT ++YF K+G E C +C KRATCF++ + Y CW 
Sbjct: 1707 MLSDHLVDLADSVVLVTWAASFELTCLRYFAKVGREVVCSVCTKRATCFNSRTGYYGCWR 1766 

Query: 186 HSVGFDYVYNPFMIDVQQWGFTGNLQSNHDQHCQVHOJAHVASCDAIMTRCTiAVHECFVK 245 
HS DY+YNP ++D+QQWG+TG+L SNHD C VH AHVAS DAIMTRCLAVH+CF K 
Sbjct: 1767 HSYSCDYLYNPLIVDIQQWGYTGSLTSNHDPICSVHKGAHVASSDAIMTRCLAVHDCFCK 1826 

Query: 246 RVDWSVEYPII6DELRVNSACRKVQHMWKSALLADKFPVLHDIGNPKAIKCVPQAEVEW 305 
V+W++EYPII +E+ VN++CR +Q ++ ++A+L +++ V +DIGNPK + CV ++ 
Sbjct: 1827 SVNWNLEYPIISNEVSVNTSCRLLQRVMFRAAMLCNRYDVCYDIGNPKGLACVKG— YDF 1884 

Query: 306 KFYDAQPCSDKAYKIEELFYSYAIHHDKFTDGVCLFWMCNVDRYPANAIVCRFDTRVLSN 365 
KFYDA P +++ Y Y H D+F DG+C+FWNQIVD+YPANA+VCRFDTRVL+ 

Sbjct: 1885 KFYDASPV— VKSVKQPVYKYEAHKDQFLDGLCMFWNCNVDKyPANAWCRFDTRVLNK 1941 

Query: 366 LNLPGCDGGSLYVNKHAFHTPAFDKSAFTNLKQLPFFYYSDSPCESHGKQWSDIDYVPL 425 
LNLPGC+GGSLYVNKHAFHT F ++AF NLK +PPFYYSD+PC +DYVPL 

Sbjct: 1942 LNLPGCNGGSLYVNKHAFHTSPFTRAAFBNLKPMPFFYYSDTPCVYMEGMESKQVDYVPL 2001 

Query: 426 KSATCITRO«LGGAVCRHHANEYRQYLDAYNMMISAGFSLWIYKQFDTYNLWNTFTRLQS 485 
+SATCITRCNLGGAVC HA EYR+YL++YN +AGF+ W+YK FD YNLWNTFTRLQS 
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Sbjct: 2002 RSATCITRattiGGAVOjKHfiEEYREYLESYNTATTAGFTFWVYRrFDFYNLWNTFTRLQS 2061 

Query: 486 L 486 
L 

Sbjct: 2062 L 2062 

>gi 133592 1 sp P16342 |RRPB_CVMA5 RNA-DIRECTED RNA POLYMERASE (ORFlB) 
gi 93916 |pir |S15760 genome polyprotein - murine hepatitis virus (strain 
A59) 

Length = 2733 

Score = 634 bits (1636), Esspect = 0.0 

Identities = 286/481 (59%), Positives = 364/481 (75%), Gaps = 5/481 (1%) 

Query: 6 ■^^ITRRLISM^K5FKMNYQVNGYPNMFITREEAIRHVRAWIGFDVEGC^^ATRImVG'^NLPLQ 65 
+TY RLIS+MGFK++ ++GY +EITR+EAI+ VRAW+GFD EG HA RD++GTN PLQ 
Sbjct: 1587 VTYSRLISLMGFKLDLTLIX5YCaa[jFIlTO5EAIKRVRAIfM3FDAEGAHAIRDSIGTNFPLQ 1646 

Query: 66 LGFSTGVNLVAVPTGYVraENNTEFTRVNAKPPPGDQF^IPLMYKGLPWNVVRIKIVQ 125 
LGFSTG++ V TG + F + A+ PPG+QFKHLIPLM +G W+WRI+IVQ 
Sbjct: 1647 LGFSTGIDFWEATGMFAERDGYVFKKAAARAPPGEQFKHLIPLMSRGQKWDWRIRIVQ 1706 

Query: 126 MLSDTLKGLSDRVVFVLWAHGFELTSMKYFVKIGPERTCCLCDKRATCFSTSSDTYACWN 185 
MLSD L L+D W V WA FELT ++YF K+G E C +C KRATCF++ + Y CW 
Sbjct: 1707 MLSDHLVDLADSWLVTWAASFELTCLRYFAKVGREWCSVCTKRATCFNSRT6YYGCWR 1766 

Query: 186 HSVGFDYVYNPFMIDVQQWGFTGNLQSNHDQHCQVHGNAHVASCDAIMTRCLAVHECFVK 245 
,HS DY+YNP ++D+QQWG+TG+L SNHD C VH AHVAS DAIMTRCLAVH+CF K 
Sbjct: 1767 HSYSO)YLYNPLIYDIQQWGYTGSLTSNHDPICSVHKGAHVASSDAIMTRaAVHDCFCK 1826 

Query: 246 RVDWSVEYPIIGDELRVNSACRKVQHMWKSALLADKFPVLHDIGNPKAIKCVPQAEVEW 305 
V+W++EYPII +E+ VN++CR +Q ++ ++A+L +++ V +DIGNPK + CV ++ 
Sbjct: 1827 SVNWNLEYPIISNEVSVNTSCRLLQRVMFRAAMLCNRYDVCYDIGNPKGLACVKG--YDF 1884 

Query: 306 KFYDAQPCSDKAYKIEELFYSYAIHHDKFTIX5VCLFVmajVDRYPANAIVCRFDTRVLSN 365 
KFYDA P +++ Y Y H D+F DG+C+FWNOJVIHYPAimWCRFDTRV^^ 

Sbjct: 1885 KFYDASPV VKSVKQFVYKYEAHKDQFLIX3La«F»maJVDKYPANAWCRFI^ 1941 

Query: 366 LNLPGCDGGSLYVNKHAFHTPAFDKSAPniLKQLPFFYYSDSPCESHGKQWSDIDYVPL 425 
LNLPGC4GGSLYVNKHAFHT F ++AF NLK +PFFYYSD+PC +DYVPL 

Sbjct: 1942 LNLPGCNGGSLYVNKHAFHTSPFTRAAFENLKPMPFFYYSDTPCVYME(a4BSKQVDYVPL 2001 

Query: 426 KSATCITROnJGGAVCRHHANEYRQYLDAYNMMISAGFSLVrrYKQFDTYNLWNTFTRLQS 485 
+SATCITRCNLGGAVC HA EYR+YL++YN +AGF+ W+YK FD YNLWNTFTRLQS 
Sbjct: 2002 RSATCITRCmGGAVOiKHAEEYREYLESYNTATTAGFTFWVYKTFDFYNLWNTFTRLQS 2061 

Query: 486 L 486 
L 

Sbjct: 2062 L 2062 



>gi|26008080|ref |NP_150073.2| orflab polyprotein [Bovine coronavirus] 

Length = 7094 

Score = 633 bits (1633), Expect = e-180 
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Identities = 284/481 (59%), Positives = 367/481 (76%), Gaps = 5/481 (1%) 

Query: 6 MTmLISmGFKMNYQWGYPNMFITREEAIRHVRAWIGFDVEGCHATRDAVGTNLPLQ 65 
+TY RLIS+MGFK++ ++GY +FIT+EEA++ VEAW+GPD EG HATRD++CTN PLQ 
Sbjct: 5948 VTYSRLISLMGFKLDVTLIXIYCKLFITKEEAVKRVRAWVGFDAEGAHATRDSIGTNFPLQ 6007 

Query: 66 LGFSTGVNLVAVPTGYVDTENNTEFTRVNAKPPPGDQPKHLIPLMYKGLPWNVVRIKIVQ 125 
LGFSTG++ V TG + F + AK PPG+QFKHLIPLM +G W+WR +IVQ 
Sbjct: 6008 LGFSTGIDFWEATGLFADRDGYSFKKAVAKAPPGEQFKHLIPLMTRGQRWDWRPRIVQ 6067 

Query: 126 MLSDTLKGLSDRWFVLWAHGFELTSMKYFVKIGPERTCCLCDKRATCFSTSSDTYACWN 185 
M +D L LSD W V WA FELT ++YF K+G E +C +C KRAT +++ + Y CW 
Sbjct: 6068 MFADHLIDLSDCWLVTWAANFELTCLRYFAKVGREISCNVCTKRATAYNSRTGYYGCWR 6127 

Query: 186 HSVGFDYVYNPFMIDVQQWGFTGNLQSNHDQHCQVHGNAHVASCDABFrRCLAVHECFVK 245 
HSV DY+YNP ++D+QQWG+ G+L SNHD +C VH AHVAS DAIMTRCLAV++CF 
Sbjct: 6128 HSVTCDYLYNPLIVDIQQWGYIGSLSSNHDLYCSVHKGAHVASSDAIMTRCLAVYDCFCN 6187 

Query: 246 RVDWSVEYPIIGDELRVNSACRIO/QHMVVKSALLADKFPVLHDIGNPKAIKCVPQAEVEW 305 
++W+VEYPII +EL +N++CR +Q +++K+A+L +++ + +DIGNPKAI CV + ++ 
Sbjct: 6188 NINWNVEYPIISNELSINTSCRVLQRVMLKAAMLCNRYTLCYDIGNPKAIACV--KDFDF 6245 

Query: 306 KFYDAQPCSDKAYKIEELFYSYAIHHDKFTDGVCLFWNCagVDRYPANAIVCRFDTRVLSN 365 
KFYDAQP ++ L YS+ H D F DG+C+FWNCNVD+YP NA+VCRFDTRVL+N 

Sbjct: 6246 KFYDAQPI VKSVKTLLYSFEAHKDSFKDGLCMFWNCNVDKYPPNAWCRFDTRVLNN 6302 

Query: 366 LNLPGCDGGSLYVNKHAFirrPAFDKSAFTNLKQLPFFYYSDSPCESHGKQWSDIDYVPL 425 
■ LNLPGC+GGSLYVNKHAFHT F ++AF +LK +PFFYYSD+PC +DYVPL 

Sbjct: 6303 LNLPGCNGGSLYVNKHAFHTKPFSRAAFEHLKPMPFFYYSDTPCVYMDGMDAKQVDYVPL 6362 

Query: 426 KSATCITRCNLGGAVCRHHANEYRQYLDAYNMMISAGFSLWIYKQFDTYNLWNTFTRLQS 485 
. KSATCITRCNLGGAVC HA BYR+YL++YN +AGF+ W+YK FD YNLWNTFT+LQS 
Sbjct: 6363 KSATCITRCm.GGAVCLKHAEEYREYLESYNTATTAGFTFWVYKTFDFYNLWNTFTKLQS 6422 

Query: 486 L 486 
L 

Sbjct: 6423 L 6423 



>gi I 15077820 |gb|AAK83365.1 I replicase [bovine coronavirus] 

Length = 7094 

Score = 633 bits (1633), Expect = e-180 

Identities = 284/481 (59%), Positives = 367/481 (76%), Gaps = 5/481 (1%) 

Query: 6 MTYRRLISMMGFKMNYQVN6YPNMFITREEAIRHVRAWIGFDVEGCHATRDAVGTNLPLQ 65 
+TY RLIS+1K3FK++ ++GY +FIT+EEA++ VRAW+GFD EG HATRD++GTN PLQ 
Sbjct: 5948 VTYSRLISLMGFKLDVTLDGYCKLFITKEEAVKRVRAWVGFDAEGAHATRDSIGTNFPLQ 6007 

Query: 66 LGFSTGVNLVAVPTGYVDTENNTEPTRVNAKPPPGDQPKHLIPLMYKGLPWNWRIKIVQ 125 
LGFSTG++ V TG + F + AK PPG+QFKHLIPLM +G W+WR +IVQ 
Sbjct: 6008 LGFSTGIDPWEATGLFADRDGYSFKKAVAKAPPGEQFKHLIPIMRGQRWDWRPRIVQ 6067 

Query: 126 MLSDTLKGLSDRWFVLWAHGFELTSMKYFVKIGPERTCCLCDKRATCFSTSSDTYACWN 185 
M +D L LSD W V WA FELT ++YF K+G E +C +C KRAT +++ + Y CW 
Sbjct: 6068 MFADHLIDLSDCWLVTWAANFELTCLRYFAKVGREISCNVCTKRATAYNSRTGYYGCWR 6127 
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Query: 186 HSVGFDYVYNPFMIDVQQWGFimiI.QSNHDQHCQVHGNAHVASCDAIMrRCIiAVHECFVK 245 
HSV DY+YNP ++D+QQWG+ G+L SNHD +C VH AHVAS DAIMTRCI 1 AV++CF 
sbjct: 6128 HSVTCDYLYNPLrVDIQQWGYIGSLSSNHDLYCSVHKGAHVASSDAIICTRCLAVYDCFCN 6187 

Query: 246 RVDWSVBYPIIGDELRVNSACRKVQHMWKSAIiLADKFPVLHDIGNPKAIKCVPQAEVEW 305 
++W+VEYPII +EL +N++01 +Q +++K+A+L +++ + +DI(aJPKAI CV + ++ 
Sbjct: 6188 NINWNVEYPIISNELSIOTS(niVLQRVMIiKAAiniaroYTLCYDIGWPKAIACV--KDFDF 6245 

Query: 306 KFYDAQPCSDKAYKIEBLFYSYAIHHDmiDGVCIiFWNCNVDRYPANAIVCRFDTRVLSN 365 
KFYDAQP ++ L YS+ H D F DG+C+FWNCNVD+YP NA+VCRPDTRVL+N 

Sbjct: 6246 KFYDAQPI VKSVmJ^YSFEAHKDSFKDGLQIFWNaJVDIGrPPmwa^FDT^ 6302 

Query: 366 LNLPGCTGGSLYVNKHAFHTPAFDKSAFTNLKQLPFFYYSDSPCESHGKQWSDIDYVPL 425 
LNLP6C+GGSLYVNKHAFHT F ++AF +LK +PFFYYSD+PC +DYVPL 

Sbjct: 6303 LNLPGCNGGSLYVNKHAFHTKPFSRAAFEHLKPMPFFYYSDTPCVYMDGMDAKQVDYVPL 6362 

Query: 426 KSATCITRCWLGGAVCTffiHANEYRQYLDAYNMbaSAGFSLWIYKQFm^LSJNTFTRLQS 485 
KSATCITRCNLGGAVC HA EYR+YL++YN +AGF+ W+YK FD YNLWNTFT+LQS 
Sbjct: 6363 KSATCITRCinjGGAVCI.KHAEEYREYIiESYNTATTAGFTFWVYKTFDFYNIiWNTFTKLQS 6422 

Query: 486 L 486 
L 

Sbjct: 6423 L 6423 



>gill8033972 |gb|AAL57305.l| replicase [bovine corcnavirus] 

Length = 7094 

Score = 633 bits (1633), Expect = e-180 

Identities .= 284/481 (59%), Positives = 367/481 (76%), Gaps = 5/481 (1%) 

Query: 6 - MTYRRLISMMGFKMNYQVNGYPNMFITREEAIRHVRAWIGFDVEGCHATRDAVGTNLPLQ 65 

+TY RLIS+MGFK++ ++GY +FIT+EEA++ VRAW40FD EG HATRD++GTN PLQ 
Sbjct: 5948 VTYSRLISLMGFKLDVTLDGYCKLFITKEEAVKRVRAWVGFDAEGAHATRDSIGTNFPLQ 6007 

Query: 66 LGFSTGVNLVAVPTGYVDTENNTEFTRVNAKPPPGDQFKHLIPLMYKGLPWNVVRIKIVQ 125 
LGFSTG++ V TG + F + AK PPG+QFKHLIPLM +G W+WR +IVQ 
Sbjct: 6008 LGFSTGIDFWEATGLFADRDGYSFKKAVAKAPPGEQFKHLIPUOTRGQRWDWRPRIVQ 6067 

Query: 126 MLSDTLKGLSDRVVFVLWAHGFELTSMKYFVKIGPERTCCLCDKRATCFSTSSDTYACWN 185 
M +D L LSD W V WA FELT ++YF K+G E +C +C KRAT +++ + Y CW 
Sbjct: 6068 MFADHLIDLSDCWLVTWAANFBLTCLRYFAKVGREISCNVCTKRATAYNSRTGYYGCWR 6127 

Query: 186 HSVGFDYVYNPFMIDVQQWGFTGNLQSNHDQHCQVHGNAHVASCDAIMTRCLAVHECFVK 245 
HSV DY+YNP ++D+QQWG+ G+L SNHD +C VH AHVAS DADfrRCLAV++CF 
Sbjct: 6128 HSVTCDYLYNPLIVDIQQWGYIGSLSSNHDLYCSVHKGAHVASSDAIMTRCLAVYDCFCN 6187 

Query: 246 RVDWSVEYPIIGDELRVNSACRKVQHMWKSALLADKFPVLHDIGNPKAIKCVPQAEVBW 305 
++W+VBYPII +EL +N++CR +Q +++K+A+L +++ + +DIGNPKAI CV + ++ 
Sbjct: 6188 NINWNVEYPIISNELSINTSCRVIjQRVMLKAAMLCNRYTLCYDIGNPKAIACV--KDFDF 6245 

Query: 306 KFYDAQPCSDKAYKIEELFYSYAIHHDKFTDGVCLFV«NCNVDRYPANAIVCRFDTRVLSN 365 
KFYDAQP ++ L YS+ H D F DG+C+FWNCNVD+YP NA+VCRFDTRVL+N 

Sbjct: 6246 KFYDAQPI— VKSVKTLLYSFEAHKDSFKDGLCMFWNCNVDKYPPNAVVCRFDTRVLNN 6302 
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Queiy: 366 LNLPGCDGGSLYVNKHAFHTPAFDKSAPINLKQLPFFYYSDSPCESHGKQWSDIDYVPL 425 
LNLPGC+GGSLYVNKHAFHT F ++AF +LK +PPFYYSEH-PC +DYVPL 

Sbjct: 6303 IJOT^PGCWGGSLYVNiaiAFHTKPPSRMFEHLiaPMPFFYYSDTPCVYMIXSMDAKQTO 6362 

Query: 426 KSATCITRQJLGGAVaUfflANEYRQYIJ)AYNMMISAGFSLWIYKQFDTYNLMTFTRIiQS 485 
KSATCITRCNLGGAVC HA EYR+YL++YN +AGF+ W+YK FD YNLWNTFT+LQS 
Sbjct: 6363 KSATCITRCNLGGAVOjKHAEEyREYI^SYNTATTAGFTFWVYrayDFYNLWNTFTia^S 6422 

Query: 486 L 486 
L 

Sbjct: 6423 L 6423 



>gil7769353 |gb|AAF69342.l|AF208067_2 RNA-directed RNA polymerase [murine 
hepatitis virus] 

Length =2733 

Score = 633 bits (1633), Expect = e-180 

Identities = 285/481 (59%), Positives = 364/481 (75%), Gaps = 5/481 (1%) 

Query: 6 MTYKUJISMMGFKMNYQWGYPNMFITREEAIIUH/RAWIGFDVEGCHATRDAVGTNLPIiQ 65 
++Y RLIS+MGFK++ ++GY +FITR+EAI+ VRAW+GFD EG HA RD++GTN PLQ 
Sbjct: 1587 VSYSRLISLMGFKLDLTLDGYCKLFITRDEAIKRVRAWVGFDAEGAHAIRDSIGTNFPLQ 1646 

Query: 66 IX5FSTGVNLVAVPTGYVDTENNTEFTRVNAKPPPGDQFKHLIPIMYK6LPWNVVRIKIVQ 125 
LGFSTG++ V TG + F + A+ PPG+QFKHLIPLM +G W+WRI+IVQ 
Sbjct: 1647 LGFSTGIDFWEATGMFAERDGYVFKKAAARAPPGEQFKHLIPLMSRGQKWDWRIRIVQ 1706 

Query: 126 MLSDTLKGLSDRVVFVLWAHGFELTSMKYFVKIGPERTCCLCDKRATCFSTSSDTYACWN 185 
MLSD L L+D W V WA FELT ++YF K+G E C +C KRATCF++ + Y CW 
Sbjct: 1707 MLSDHLVDLADSVVLVTWAASFELTCLRYFAKVGREVVCSVCTKRATCFNSRTGYYGCWR 1766 

Query: 186 HSVGFDYVYNPFtCEDVQQWGFTGNLQSNHDQHCQVHGNAHVASCDAIMTRCLAVHECFVK 245 
HS DY+YNP ++D+QQWG+TG+L SNHD C VH AHVAS DAIMTRCLAVH+CP K 
Sbjct: 1767 HSYSCDYLYNPLIVDIQQWGYTGSLTSNHDPICSVHKGAHVASSDAIMTRCLAVHDCFCK 1826 

Query: 246 RV13WSVEYPIIGDELRWSACRKVQHMVVKSALLADKFPVLHDIGNPKAIKCVPQAEVEW 305 
V+W++BYPII +B+ VN++CR +Q ++ ++A+L +++ V +DIGNPK + CV ++ 
Sbjct: 1827 SVNWNLEYPIISNEVSVNTSCRLLQRVMFRAAMLCNRYDVCYDIGNPKGLACVKG--YDF 1884 

Query: 306 KFYDAQPCSDKAYKIEELFYSYAIHHDKFTDGVCLFWNCNVDRYPANAIVCRFDTRVLSN 365 
KFYDA P +++ Y Y H D+F IX3+C+FWNOiIVD+YPANAW(3lFDTRVL+ 

Sbjct: 1885 KFYDASPV---VKSVKQFVYKYEAHKIQFLDGL(3!FWNaiVDKYPANAWaiFDTRVIiNK 1941 

Query: 366 LNLPGCDGGSLYVNKHAFHTPAFDKSAFTNLKQLPFFYYSDSPCESHGKQWSDIDYVPL 425 
LNLPGC+GGSLYVNKHAFHT F ++AF NLK +PFFYYSD+PC +DYVPL 

Sbjct: 1942 LNLPGCNGGSLYVNKHAFHTSPFTRAAFENLKPMPFFYYSDTPCVYMEGMESKQVDYVPL 2001 

Query: 426 KSATCITROHiGGAVCHHHANEYRQYLDAYNMMISAGFSLWIYKQFDTYNLWNTFTRLQS 485 
+SATCITRCNLGGAVC HA EYR+YL++YN . +AGF+ W+YK FD YNLWNTFTRLQS 
Sbjct: 2002 RSATCITRCNLGGAVCLKHAEEYREYLESYNTATTAGFTFWVYKTFDFYNLWNTFTRLQS 2061 

Ques^: 486 L 486 
L 

Sbjct: 2062 L 2062 
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>gi|l7529672|gb|AAL40397.l|AF220295_2 RNA polymerase lb [bovine 
coronavirus] 

Length = 2685 

Score = 623 bits (1607), Esqpect = e-177 

Identities = 282/481 (58%), Positives = 365/481 (75%), Gaps = 5/481 (1%) 

Query: 6 MTYRRLISmGFKMNYQWGYPNMFITREEAIRHVRAWIGFDVEGCHATRDAVGTNLPLQ 65 
+TY RLIS+MGFK++ ++GY +FIT+EEA++ VRAW+GFD EG HATRD++Gra PLQ 
Sbjct: 1574 VTYSKLISLMGFKLDVTljDGYCaaiFITKEEAVKRVRAWVGFDAEGAHATRDSIGTNFPLQ 1633 

Query: 66 LGFSTGVNLVAVPTGYVI)TENNTEFTRVNAia>PPGDQFKHLIPLMYKGLPWNVVRIKIVQ 125 
LGFSTG++ V TG + F + AK PPG+QFKHLIPLM +G W+WR +IVQ 
Sbjct: 1634 LGFSTGIDFWEATGLFADRDGYSFKKAVAKAPPGEQFKHLIPLMTRGQRWDWRPRIVQ 1693 

Query: 126 MLSDTLKGLSDRVVFVLWAHGFELTSMKYFVKIGPERTCCLCDKRATCFSTSSDTYACWN 185 
M +D L LSD W V WA FELT ++YF K+G E +C + KRAT +++ + Y CW 
Sbjct: 1694 MFADHLIDLSDCWLVTWAANFELTCLRYFAKVGREISCNVSTKRATAYNSRTGYYGCWR 1753 

Query: 186 HSVGFDYVYNPFMIDVQQWGFTGNLQSNHDQHCQVHGNAHVASCDAIMTRCLAVHECFVK 245 
HSV DY+YNP ++D+QQWG+ G+L SNHD +C VH AHVAS DAIMTRCLAV++CF 
Sbjct: 1754 HSVTCDYLYNPLIVDIQQWGYIGSLSSNHDLYCSVHKGAHVASSDAIMTRCLAVYDCPCN 1813 

Query: 246 RVDWSVEYPIIGDELRVNSACRKVQHMVVKSALLADKFPVLHDIGNPKAIKCVPQAEVEW 305 
++W+VEYPII +EL +N++CR +Q +++K+A+L +++ + +DIGNPKAI CV + ++ 
Sbjct: 1814 NINWNVEYPIISNELSINTSCRVLQRVMLKAAMLCNRYTLCYDIGNPKAIACV— KDFDF 1871 

Query: 306 KFYDAQPCSDKAYKIEELFYSYAIHHDKFTDGVCLFWNCNVDRYPANAIVCRFDTRVLSN 365' 
KFYDAQP ++ L Y + H D F DG+C+FWNCNVD+YP NA+VCRFDTRVL+N 

Sbjct:. 1872 KFYDAQPI VK5VKTLLYFFEAHKDSFKDGLCMFWNCNVDKYPPNAVVCRFDTRVLNN 1928 

Query: 366 LNLPGCDGGSLYVNKHAFHTPAFDKSAFTNLKQLPFFYYSDSPCESHGKQWSDIDYVPL 425 
•LNLPGC+GGSLYVNKHAFHT F ++AF +LK +PFFYYSD+PC +DYVPL 

Sbjct: 1929. LNLPGCNGGSLYVNKHAFHTKPFSRAAFEHLKPMPFFYYSDTPCVYMDGMDAKQVDYVPL 1988 

Query: 426 KSATCITRCNLGGAVCRHHANEYRQYLDAYNMMISAGFSLWIYKQFDTYNLWNTFTRLQS 485 
KSATCITRCNLGGAVC HA EYR+YL++YN +AGF+ W+YK FD YNLWNTFT+LQS 
Sbjct: 1989 KSATCITRCmjGGAVCLKHAEEYREYLESYIWATTAGFTFW\nfKTFDFYNLWNTFTKLQS 2048 

Query: 486 L 486 
L 

Sbjct: 2049 L 2049 



>gi 1 251215711 ref I NP_740618.ll coronavirus nspll [Murine hepatitis virus] 
Length = 521 

Score = 622 bits (1603), Bqject = e-177 

Identities = 284/479 (59%), Positives =-362/479 (75%), Gaps = 5/479 (1%) 

Query: 6 MTYRRLISMMGFKMNYQVNGYPNMFITREEAIRHVRAWIGFDVEGCHATRDAVGTNLPLQ 65 
+TY RLIS•^MGFK++ +-^GY +FITR+EAI+ VRAW+GFD EG HA RD++GTN PLQ 
Sbjct: 48 VTYSRLISLMGFKLDLTLIXSYCKLFITRDEAIKRVRAWVGFDAEGAHAIRDSIGTNFPLQ 107 

Query: 66 LGFSTGVNLVAVPTGYVDTENNTEFTRVNAKPPPGDQFKHLIPLMyKGLPWNWRIKIVQ 125 
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LGFSTG++ V TG + P + A+ PPG+QFKHLIPLM +G W+WRI+IVQ 
Sbjct: 108 ]U3FSTGn)FVVBATGMFAEraX5YVFKKAAARAPPGEQFKHLIPLMSRGQKWDVVRIRIVQ 167 

Query: 126 MLSIMliK6LSDRVVFVLWAHGFELTSMKYFVKIGPERTCCIXn)KRATCFSTSSDTYACWN 185 
MLSD L L+D W V WA FELT ++YF K+G E C +C KRATCF++ + Y CW 
Sbjct: 168 MiSDHLaDLM)SVVLVTWAASFELTa:iRYFAKVGREWCSVCTKRATCFNSRTGYYGCWR 227 

Query: 186 HSVGFDYVYNPFMIDVQQWGFTGNIiQSNHDQHCQVHGNAHVASCTAIMTRCLAVHECFVK 245 
HS DY+YNP ++IHQQWG+TG+L SNHD C VH AHVAS DAHTTRCLAVH+CF K 
Sbjct: 228 HSYSCTYLYNPLIWIQQWGYTGSLTSNHDPICSVHKGAHVASSDAIMTRCLAVHDCFCK 287 

Query: 246 RVDWSVEYPIIGDELRWSAO^QHMVVKSALLADKFPVLHDICTIPKAIKCVPQAEVEW 305 
V+W++EYPII +K+ VN++CR +Q ++ ++A+L +++ V +DIGNPK + CV ++ 
Sbjct: 288 SVNW^^iEYPIISNEVSVNTSaU^QRVMFRAAMLCNRYDVC^YDIGNPKGLACVKG--YDF 345 

Query: 306 KFYDAQPCSDKAYKIEBLFYSYAIHHDKFTDGVCLFVmaAroRYPMAirVC^^FDTRVLSN 365 
KFYDA P . +++ YY H IHF IX3+C+FVmCMVIHYPANA+VaOT)TRVL+ 

Sbjct: 346 KFYDASPV---VKSVKQF\nfKYEAHKDQFLDGLCMFWNCNVDKYPANAVVCRFDTRVLNK 402 

Query: 366 LNLPGCDGGSLYVNKHaFHTPAPDKSAFTNLKQLPFFYYSDSPCESHGKQWSDIDYVPL 425 
LNLPGC+GGSLYVNKHAFHT F ++AF NLK +PPFYYSD+PC +DYVPL 

Sbjct: 403 LNLPGOIGGSLYVNKHAFHTSPFTRAAFENLKPMPFFYYSDTPCVYMECaiESKQVDYVPL 462 

Query: 426 KSATCITRCmGGAVCTlHHANEYRQYIjDAYNMMISAGFSLWIYKQFDTYNLWNTFTRIjQ 484 
+SATCITRCNLGGAVC HA EYR+YL++YN +AGF+ W+YK FD YNLWNTFTRLQ 
Sbjct: 463 RSATCITRQHiGGAVCLKHAEEYREYLESYOTATTAGFTFWVYKTFDFYNLWNTFTRLQ 521 



>gi(26008092|ref |NP_742140.1| coronavirus nspll [Bovine coronavirus] 
Length = 521 

Score = 617 bits (1590), Expect = e-175 

Identities = 282/479 (58%), Positives = 365/479 (76%), Gaps = 5/479 (1%) 

Query: 6 MTYRRLISMMGFKMNYQWGYPNMFITREEAIRHTOAWIGFDVEGCamTRDAVGTN^ 65 
+TY RLIS+MGFK++ ++GY +FIT+EEA++ VRAW+GFD EG HATRD++GTN PLQ 
Sbjct: 48 VTYSRLISLMGFKLDVTLDGYCKLFITKEEAVKRVRAWVGFDAEGAHATRDSIGTNFPLQ 107 

Query: 66 LGFSTGVNLVAVPTGYVDTENNTEFTRVNAKPPPGDQFKHLIPLMYKGLPWNVVRIKIVQ 125 
LGFSTG++ V TG + F + AK PPG+QFKHLIPLM +G W+WR +IVQ 
Sbjct: 108 LGFSTGIDFWEATGLFADRDGYSFKKAVAKAPPGEQFKHLIPLMTRGQRWDWRPRIVQ 167 

Query: 126 MLSDTLKGLSDRWFVLWAHGFELTSMKYFVKIGPERTCCLCDKRATCFSTSSDTYACWN 185 
M +D L LSD W V WA PELT ++YF K+G E +C +C KRAT +++ + Y CW 
Sbjct: 168 MFADHLIDLSDCWLVTWAANFELTCLRYFAKVGREISCNVCTKRATAYNSRTGYYGCWR 227 

Query: 186 HSVGFDYVYNPFMIDVQQWGPTGNLQSNHDQHCQVHGNAHVASCDAIMTRCLAVHECFVK 245 
HSV DY+YNP ++D+QQWG+ G+L SNHD +C VH AHVAS DAIlCrRCLAV++CF 
Sbjct: 228 HSVTCDYLYNPLIVDIQQWGYIGSLSSNHDLYCSVHKGAHVASSDAIMTRCLAVYDCFCN 287 

Query: 246 RVDWSVEYPIIGDELRVNSACRKVQHMWKSALLADKFPVLHDIGNPKAIKCVPQAEVEW 305 
++W+VEYPII +EL +N++CR +Q +++K+A+L +++ + +DIGNPKAI CV + ++ 
Sbjct: 288 NINWNVEYPIISNELSINTSCRVLQRVMLKAAMLCNRYTLCYDIGNPKAIACV— KDFDF 345 

Query: 306 KFYDAQPCSDKAYKIEBLFYSYAIHHDKFTDGVCLFWNCNVDRYPANAIVCRFDTRVLSN 365 
KFYDAQP ++ L YS+ H D F DG+C+FWNCNVD+YP NA+VCRFDTRVL+N 
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Sbjct: 346 KFYDAQPI---VKSVKTIiYSFEAffia)SFKIX31jayiFWNaM)KypPNAWCRFD^^ 402 

Query: 366 LNLPGCDGGSLYVNKHAFHTPAFDKSAFTNLKQLPFFYYSDSPCESHGKQWSDIDYVPL 425 
LNLPGC+GGSLYVNKHAFHT F ++AF +LK +PFFYYSD+PC +DYVPL 

Sbjct: 403 LNLPGaSPGGSLYVNKHAFHTKPFSRAAFEHLKPMPFFYYSDTPCVYMDGMDAKQTO^ 462 

Query: 426 KSATCITRCMjGGAVCRHHANEYRQYLDAYNMMISAGFSLWIYKQFiyiyNLWNTFTRLQ 484 
KSATCITROJLGGAVC HA EYR+YL++YN +AGF+ W+YK FD YNLWNTFT+LQ 
Sbjct: 463 KSATCITRCmXSGAVCLKHAEEYKmiESYOTATTAGFTFWVYKTFDFYMiWNTFTKL 521 

>gi|l0242469|ref |NP_066134.1| ORFlab polyprotein; frameshift product [Avian 
infectious bronchitis 
virus] 

Length = 6629 

Score = 575 bits- (1482), E;q>ect = e-163 

Identities = 262/482 (54%), Positives = 344/482 (71%), Gaps = 5/482 (1%) 

Query: 5 DOTYERLISmGFKMNYQWGYPNMFITREEAIRHVRAWIGFDVEGCHATRDAVGTNLPL 64 
++TY+ LIS++GFKM+ V G NMFITR+EAIR+VR W+GFDVE HA +GTNLP 
Sbjct: 5515 EITYKHLISLLGFKMSVNVEGCHNMPITRDEAIRNVRGWVGFDVEATHACGTNIGTNLPF 5574 

Query: 65 QIX3FSTGVNLVAVPTGYVDTENNTEFTRVNAKPPPGDQFKHLIPLMYKGLPWNVVRIKIV 124 
Q+GFSTG + V P G VDT F VN+K. PPG+QF HL Ti PW+V+R +IV 
Sbjct: 5575 QVGFSTGADFWTPEGLVDTSIGNNFEFVNSKAPPGEQFNHLRVLFKSAKPWHVIRPRIV 5634 

Query: 125 QM.SDTLKGLSDRVVFVLWAHGFELTSMKYFVKIGPERTCCLCDKRATCFSTSSDTYACW 184 
QML+D L +SD WFV W HG‘ ELT+++YFVKIG E+ C C RAT F++ + YACW 
Sbjct: 5635 QMIADNLOJVSIXirVVFVTWaiGLELTTLRYFVKIGKEQVCS-CGSRATTFNSHTQAYACW 5693 

Query: 185 NHSVGFDYVYNPFMIDVQQWGFTG^SNHDQHCQVHGNAHVASCDAIMrRCIiAVHECFV 244 
H +GFD+VYNP ++D+QQW6++GNLQ NHD HC VHG+AHVAS DAIMTRCLA++ F 
Sbjct: 5694 KHCLGFDFVYNPLLVDIQQWGYSGNLQFNHDLHCNVHGHAHVASVDAIMTRCLAIMNAFC 5753 

Query: 245 KRVDWSVEYPIIGDELRVNSACRKVQHMWKSALLADKFPVLHDIGNPKAIKCVPQAEVE 304 
+ V+W + YP I +E VNS+CR +Q M + + + A K V++DIGNPK IKCV + +V 
Sbjct: 5754 QDVNWDLTYPHIAira)E\mSSatYIK2RMYLNACVDAIiia/NVVYDIGNPKGIKCVRR6DVN 5813 

Query: 305 VKFYDAQPCSDKAYKIEELFYSYAIHHDKFTIXJVa^FWNCNVDRYPANAIVCRFDTRVLS 364 
++FYD P + E Y Y H DKF DG+C+FWNCNVD YP N++VCR+DTR LS 

Sbjct: 5814 FRFYDKNPIVRNVKQFE YDYNQHKDKFADGLCMFWNCNVDCYPDNSLVCRYDTRNLS 5870 

Query: 365 NLNLPGCDGGSLYVNKHAFHTPAFDKSAFTNLKQLPFFYYSDSrcESHGKQWSDIDYVP 424 
NLPGC+GGSLYVNKHAF+TP FD+ +F NLK +PFF+Y SPCE+ V+ D V 
Sbjct: 5871 VFNLPGCNGGSLYVNKHAFYTPKFDRISFRNLKAMPFFFYDSSPCETIQVDGVAQ-DLVS 5929 

Query: 425 LKSATCITRCinjGGAVCRHHANEYRQYIiDAYNMMISAGFSLWrYKQFDTYNLWNTFTRLQ 484 
L + CIT+CN+GGAVC+ HA Y +++ +YN ++AGF+ W+ + + YNLW +F+ LQ 
Sbjct: 5930 LATKrciTKOnGGAVCKKHAQMYAEFVTSYNAAVTAGFTFWVTNKLNPYNLWKSFSALQ 5989 

Query: 485 SL 486 
S+ 

Sbjct: 5990 SI 5991 




wo 2004/092360 PCT/US2004/011710 

169/193 

>gi|l4149033|aDab|CAC39112.1| replicase polyprotein lab [Avian infectious 
bronchitis virus (strain 
Beaudette CK) ] 

Length = 6629 

Score = 575 bits (1482) , Esqpect = e-163 

Identities = 262/482 (54%), Positives = 344/482 (71%), Gaps = 5/482 (1%) 

(Juery: 5 IMTYIWLISMMGFKMNYQWGYPNMFITRBEMRHVRAraGFDVEGamTRDAVGTlSr^ 64 

++TY+ LIS++GFKM+ V G NMFITR+EAIR+VR W+GFDVE HA +GTNLP 
Sbjct: 5515 EITYKHLISLLGFKMSVNVEGCHNMFITRDEAIRNVRGWVGFDVEATHACGTNIGTNLPF 5574 

Query: 65 QI/3FSTGVNLYAVPTGYVDTENNTEFTRVNAKPPPGDQFKHLIPLMYKGLPVJNVVRIKIV 124 
Q+GFSTG + V P G VDT F VN+K PPG+QF HL L PW+V+R +IV 
Sbjct; 5575 QVGFSTGADFWTPEGLVDTSIGNNFEPVNSKAPPGEQFNHLRVLFKSAKPWHVIRPRIV 5634 

Query; 125 QMLSOTLKGLSDRVVFVLWAHGFELTSMKYFVKIGPERTCCLCDKRATCFSTSSDTYACW 184 
QML+D L +SD WFV W HG ELT+++YFVKIG E+ C C RAT F++ + YACW 
Sbjct; 5635 QMIJH)NLCWSDCVVFVTWCHGLELTTIiRYFVKIGKEQVCS-CGSRATTFNSHTQAYACW 5693 

Query: 185 NHSVGFDYVYNPFMIDVQQ/JGFTGNLQSNHDQHCQVHGNAHVASCDAIMrRCLAVHECFV 244 
H +GFD+VYNP ++D+QQWG++GNLQ NHD HC VHG+AHVAS DAHfTRCLA++ F 
Sbjct: 5694 I®CLGFDFVYNPLLVDIQ0(JGYSGNIiQFmroLHaNVHGHAHVASVDAIl(EPRCLAINNAFC 5753 

Query: 245 KRVDWSVEYPIIGDELRVNSACRKVQHMWl^ALLADKFPVLHDIGNPKAIKCVPQAEVE 304 
+ V+W + YP I +E VNS+CR +Q M + + + A K V++DIGNPK IKCV + +V 
Sbjct: 5754 QDVNWDLTYPHIANEDEVNSSCRYLQRMYLNACVDALKVNVVYDIGNPKGIKCVRRGDVN 5813 

Query: 305 WK5YDAQPCSDKAYKIEELFYSYAIHHDKFTDGVCLFWNCNVDRYPANAIVCRFDTRVLS 364 
++FYD P + E Y Y H DKF DG+C+FWNCNVD YP N++VCR+DTR LS 

Sbjct: 5814 FRFYDKNPIVRNVKQFE YDYNQHKDKFADGLCMFWNCIMXJYPDNSLVOTYDTRNLS 5870 

Query: 365 • NLNLPGCDGGSLYVNKHAFHTPAFDKSAFTNLKQLPFFYYSDSPCESHGKQWSDIDYVP 424 
NLPGC+GGSLYVNKHAF+TP FIH- +F NLK +PFF+Y SPCE+ V+ D V 
Sbjct: 5871 VFNLPGCNGGSLYVNKHAFYTPKFDRISFRNLKAMPFFFYDSSPCETIQVDGVAQ-DLVS 5929 

Query: 425 LKSATCITRCmGGAVamHANEYRQYLDAYl®lMISAGFSLWIYKQFDTYNLWNTFTRLQ 484 
L + CIT+CN+GGAVC+ HA Y +++ +YN ++AGF+ W+ + + YNLW +F+ LQ 
Sbjct: 5930 lATKTCITKOJIGGAVCaaamQMyAEFVTSYNAAOTAGFTFWVTNKLNPYNLWKSFSALQ 5989 

Query: 485 SL 486 
S+ 

Sbjct: 5990 SI 5991 



>gi|458735|emb|CAA83018.l| potential chimeric protein [Avian infectious 
bronchitis virus] 

Length = 2155 

Score = 570 bits (1470), Ej 5 )ect = e-161 

Identities = 262/482 (54%), Positives = 344/482 (71%), Gaps = 5/482 (1%) 

Query: 5 IMTYRRLISMMGFKMNYQVNGYPNMPITREEAIRHVRAWIGFDVEGCHATRDAVGTNLPL 64 
++TY+ LIS++GFKM+ V G NMFITR+EAIR+VR W+GFDVE HA +GTNLP 
Sbjct: 1596 EITYKHLISLLGFKMSVNVEGCHNMFITRDEAIRNVRGWVGFDVEATHACGTNIGTNLPF 1655 




wo 2004/092360 



170/193 



PCT/US2004/011710 



Query: 65 QLGFSTGVNLVAVPTGYVDTENNTEFTRVNAKPPPGDQFKHLIPLMYKGLPWNVVRIKIV 124 
Q+GFSTG + V P G VDT F VN+K PPG+QF HL L PW+V+R +IV 
Sbjct: 1656 QV6FSTGADFWTPKSLVDTSI6NNFBPVNSKAPPGEQFNHLRVLFKSAKPWHVIRPRIV 1715 

Query: 125 QMLSDTLKGLSDRVVFVLWMGFELTSMKYFVKIGPERTCCLCDKRATCFSTSSDTyACW 184 
QML+D L +SD WFV W HG ELT+++YFVKIG B+ C C RAT F++ + YACW 
Sbjct: 1716 QMIJ^NLCNVSDCVVFVTWCHGIiBLTTLRYFVKIGKBQVCS-CGSRATTFNSHTQAYACW 1774 

Query: 185 lMSVGFDYVYNPFMIDVQQWGFTG^^^QSNHDQHCQVH6NAHVASCE)AIMTRCIlAVHECFV 244 
H +GFD+VYNP ++D+QWG++GNLQ NHD HC VHG+AHVAS DAIMTRCTA++ F 
Sbjct: 1775 KHajGFDFVYNPLLVDIQQWGYSGNLQFNHDLHCNVHGHAHVASVDAIMrRCLAINNAFC 1834 

Query: 245 KRVOTSVBYPIIGDELRVNSACJUCVQHMVVKSALLADKFPVLHDIGNPKAIKCVPQABVB 304 
+ V+W + YP I +E VNS+CR +Q M + + + A K V++DI6NPK IKCV + +V 
Sbjct: 1835 QIJVNWDLTYPHIANEDEVNSSCRYLQRMYIiNACVDALKVNVVYDIGNPKGIKCVRRGDVN 1894 

Query: 305 WKFYDAQPCSDKAYKIEELFySYAIHHDKFTDGVCLFWNCiNVDRYPANAIVCRFDTRVLS 364- 
++FYD P + E Y Y H DKF DG+G+FWNCNVD YP N++VCR+DTR LS 

Sbjct: 1895 FRFYDKNPIVRNVKQFE YDYNQHIODKFAIXSLCMFWNCNVDCYPDNSLVCRYDTRNLS 1951 

Query: 365 NLNLPGCDGGSLYVNKHAFHTPAFDKSAFTNLKQLPFFYYSDSPCESHGKQWSDIDYVP 424 
NLPGC4GGSLYVNKHAF+TP FD+ +F NLK +PFF+Y SPCE+ V+ D V 
Sbjct: 1952 VFNLPGCNGGSLYVNKHAFYTPKFDRISFRNLKAMPFFFYDSSPCETIQVDGVAQ-DLVS 2010 

Query; 425 IJCSATCITRCOTiGGAVCRHHANEYRQYLDAYNMMISAGFSLVniYKQFDTYNLWNTFTRLQ' 484 
L + CIT+CN+GGAVC+ HA Y +++ +YN ++AGF+ W+ + + YNLW +F+ LQ 
Sbjct: 2011 LATKIX:iTKanGGAVCKKHAQMYAEFVTSYNAAVTAGFTFWVTNKLNPYNLWKSFSALQ 2070 

Query: 485 SL 486 
S+ 

Sbjct: 2071 SI 2072 

>gi 133594 1 sp P26314|RRPB_IBVB RNA-DIRECTED RNA POLYMERASE (ORFlB) 
gi 74826 |pir |VFIHB2 genome polyprotein - avian infectious bronchitis 
virus (strain 

Beaudette) 

gi 292953 1 gb AAA70234.1 pol protein [Avian infectious bronchitis virus] 
gi 331173 1 gb AAA46224.1 ORFlb [Avian infectious bronchitis virus] 

Length = 2652 

Score = 570 bits (1469) , Ejqpect = e-161 

Identities = 262/482 (54%) , Positives = 344/482 (71%) , Gaps = 5/482 (1%) 

Query; 5 IMTYRRLISMMGFKMNYQVNGYPNMFITREEAIRHVRAWIGFDVEGCHATRDAVGTNLPL 64 
++TY+ LIS++GFKM+ V G NMFITR+EAIR+VR W+GFDVE HA +GTNLP 
Sbjct: 1538 BITYKHLISLLGFKMSVNVEGCHNMFITRDEAIRNVRGWVGFDVEATHACGTNIGTNLPF 1597 

Query: 65 QLGFSTGVNLYAVPTGYVDTENNTEFTRVNAKPPPGDQFKHLIPLMYKGLPWNWRIKIV 124 
Q+GFSTG + V P G VDT F VN+K PPG+QF HL L PW+V+R +IV 
Sbjct: 1598 QVGFSTGADFWTPEGLVDTSIGNNFBPVNSKAPPGEQFNHLRVLFKSAKPWHVIRPRIV 1657 

Query: 125 QMLSDTLKGLSDRWFVLWAHGFELTSMKYFVKIGPERTCCLCDKRATCFSTSSDTYACW 184 
QML+D L +SD WFV W HG ELT+++YFVKIG E+ C C FAT F++ + YACW 
Sbjct; 1658 QMLADNLCNVSDCWFVTWCHGLELTTLRYFVKIGKEQVCS-CGSRATTFNSHTQAYACW 1716 
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Query: 185 IfflSVGFDYVYNPFMIDVQQWGFTGNLQSNHDQHCQVHGaaMIVASCTAIMTRCIAVHECFV 244 
H +GFEH-VYNP ++D+QQWG++GNLQ NHD HC VHG+AHVAS DAIMTRCI 1 A++ F 
Sbjct: 1717 KHOiGFDFVYNPLLVDIQQMSYSCaaiiQFNHDLHCNVHGHAHVASVDAIimiajAINNAFC 1776 

Query: 245 KRVIWSVEYPIIGDELRVNSACRKVQHMVVKSAIiLADKFPVLHDIGNPKAIKCVPQAEVE 304 
+ V+W + YP I +E VNS+CR +Q M + + + A K V++DIGNPK IKCV + +V 
Sbjct: 1777 QE^/IMJLTYPHIANEDEVNSSCRyLQRMYIiNACTOALKVNVVYDIGNPKGIKCVRRGDVN 1836 

Query: 305 WKFYDAQPCSDKAYKIEELFYSYAIHHDKFTDGVCLFWNCNVDRYPANAIVCRFDTRVLS 364 
++FYD P + B Y Y H DKF DGvC+FWNCNVD YP N++VCR+DTR LS 

Sbjct: 1837 FRFYDKNPIVRNVKQPE YDYNQHIODKFAIXSLCMFWNajVDCYPDNSLVaiYDTRNLS 1893 

Query: 365 NLNLPGCDGGSLYVNKHAFHTPAFDKSAFTNLKQLPFFYYSDSPCESHGKQWSDIDYVP 424 
NLPGC+GGSLYVNKHAF+TP FD+ +F NLK +PFF+Y SPCE+ V+ D V 
Sbjct: 1894 VFNLPGCNGGSLYVNKHAFYTPKFDRISFRNLKAMPFFFYDSSPCETIQVDGVAQ-DLVS 1952 

Query: 425 LKSATCITROJLGGAVCnUJHaNEYRQYLDAYNMMISAGPSLWIYKQFOTYNLWNTFTRLQ 484 
L + CIT+CN+GGAVC-1- HA Y +++ +YN ++AGP+ W+ + + YNLW +F+ LQ 
Sbjct: 1953 LATKDCITKCNIGGAVCKKHAQMyAEPVTSYNAAVTAGFTFWVTOKIiNPYNLWKSFSALQ 2012 

Query: 485 SL 486 
S+ 

Sbjct: 2013 SI 2014. 



>gi|29293454|gb|AAO67706.l| ORFlb polyprotein [Avian infectious bronchitis 
virus] 

Length = 2649 

Score = 565 bits (1455), Ejqpect = e-160 

Identities = 261/482 (54%), Positives = 342/482 (70%), Gaps = 8/482 (1%0 

Query: 5 IMrYRRLISMMGFKMNYQVNGYPNMFITREEAIRHVRAWIGFDVEGCHATRDAVGTNLPL 64 
++TY+ LIS++GFKM+ V G NMFITR+EAIR+VR W+GFDVE HA +GTNLP 
Sbjct: 1538 EITYKHLISLLGFKMSVNVEGCHNMFITRDEAIRNVRGWVGFDVEATHACGTNIGTNLPF 1597 

Query: 65 QLGFSTGVNLVAVPT6YVDTBNNTEFTRVNAKPPPGDQFKHLIPLMYKGLPWNWRIKIV 124 
Q+GFSTG + V P G +DT F VN+K PPG+QF HL L PW+V+R +IV 
Sbjct: 1598 QVGFSTGADFWTPEGLIDTSIGNNFEPVNSKAPPGEQFNHLRALFKSAKPWHVIRPRIV 1657 

Query: 125 QMLSDTLKGLSDRWFVLWAHGFELTSMKYFVKIGPERTCCLCDKRATCFSTSSDTYACW 184 
QML+D L +SD WFV W HG ELT+++YFVKIG E+ C C RAT F++ + YACW 
Sbjct: 1658 QMLADinyaJVSDCVVFVTWCHGLELTTIiRYFVKIGKEQVCS-CGSRATTPNSHTQAYACW 1716 

Query: 185 NHSVGFDYVWFMIDVQQWGFTGOTiQSNHDQHCQVHGNAHVASCDAIMTRCLAVHECFV 244 
H +G VYNP ++D+QQWG++GNLQ NHD HC VHG+AHVAS DA+MTRCLA++ F 
Sbjct: 1717 RHCLG VYNPLLVDIQQWGYSCmQPNHDLHCNVHGHAHVASADAVMrRCLAINNAFC 1773 

Query: 245 KRVDWSVEYPIIGDELRVNSACRKVQHMWKSALLADKFPVLHDIGNPKAIKCVPQAEVE 304 
K V+W ++YP I +E VNS+CR +Q M + + + A K V++DIOJPK IKCV + +V 
Sbjct: 1774 KDVNWELQYPHIANEDEVNSSCRYLQRMYLNACVDALKVNWYDIGNPKGIKCVRRGDVN 1833 

Query: 305 WKFYDAQPCSDKAYKIEELFYSYAIHHDKFTDGVCLFWNCNVDRYPANAIVCRFDTRVLS 364 
++FYD P + E Y Y+ H DKF DG+C+FWNCNVD YP N++VCR+DTR LS 

Sbjct: 1834 FRFYDKNPIVPNVKQFE YDYSQHKDKFADGLCMFWNCNVDCYPENSLVCRYDTRNLS 1890 
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Query: 365 NIMjPGaXSGSLYVNKHAFHTPAFDKSAFTNLKQLPFFYYSDSPCESHGKQWSDIDYVP 424 
NLPGC+GGSLYVNKHAFHTP FD+ +F NLK +PFF+Y SPCE+ V+ D V 
Sbjct: 1891 VFNLPG( 3 IGGSLYVNKHAFHTPKFDRISFIUniKAMPFFFYDSSPCETIQVDGVAQ-DLVS 1949 

Query: 425 IJCSATCITRCmGGAVCRHHANEYRQYLDAYNMIGSAGPSLWIYKQFDTYNLWN^^ 484 
L + CIT+CN+GGAVC+ HA Y +++ +YN ++AGF+ W+ F+ YNLW F+ LQ 
Sbjct: 1950 lATKIXIITKCOTGGA^raaaftQMYAEFVFSYNAAVTAGFTFWVTNNFNPYNLWKNFSAIiQ 2009 

Query: 485 SL 486 
S+ 

Sbjct: 2010 SI 2011 

>gi|25121555|ref |NP_740631.1| coronavirus nspll [Avian infectious 
bronchitis virus] 

Length =521 

Score = 559 bits (1440), Expect = e-158 

Identities = 261/480 (54%), Positives = 342/480 (71%), Gaps = 5/480 (1%) 

\ 

Query: 5 IMZYlUU^ISMlfGFKMNYQWGYPNMFITREEALlIfflVRAWIGFDVEGCHATRDAVGTNLPL 64 
++TY+ LIS++GFKM+ V G NMFITR+EAIR+VR W+GFDVE HA +GTNLP 
Sbjct: 47 EITYKHLISLLGFKMSVNVEGCHNMFITRDEAIRNVRGVJVGFDVEATHACGTNIGTNLPF 106 

Query: 65 QLGFSTGVNLVAVPTGYVDTENNTEFTRVNAKPPPGDQFKHLIPLimCGLPWNVVRIKIV 124 
Q+GFSTG + V P G VDT F VN+K PPG+QF HL L PW+V+R +IV 
Sbjct: 107 QVGFSTGADFWTPEGLVDTSIGNNPEPVNSKAPPGEQFNHLRVLFKSAKPWHVIRPRIV 166 

Query: 125 QMLSDTLKGLSDRWFVLWAHGFELTSMKYFVKIGPERTCCLCDKRATCFSTSSDTYACW 184 
QML+D L +SD WFV W HG ELT+++YFVKIG E^ C C RAT F++ + YACW 
Sbjct: 167 QMLADNLCNVSDCWFVTWCHGLELTTLRYFVKIGKEQVCS-CGSRATTFNSHTQAYACW 225 

Query: 185 NHSVGFDYVYNPFMIDVQQWGFT®n.QSNHDQHCQVHGNAHVASCI)AimRCLAVHECFV 244^ 

H +GFD+VYNP ++D+QQWG++GNLQ NHD HC VHG+AHVAS DAIMTRCLA++ F 
Sbjct: 226 KHCLGFDFVYNPLLVDIQQWGYSGNLQFNHDLHCNVHGHAHVASVDAIMTRC3JVINNAFC 285 

Query: 245 KRVDWSVEYPIIGDELRVNSACRKVQHMVVKSALLADKFPVLHDICajPKAIKCVPQAEVE 304 
+ V+W + YP I +E VNS+CR +Q M + + + A K V++DIGNPK IKCV + +V 
Sbjct: 286 QDVNVTOLTYPHIANEDEVNSSCRYIiQRMyLNACVDALKVNVVYDIGNPKGIKCVRRGDVN 345 

Query: 305 WKFYDAQPCSDKAYKEEELFYSYAIHHDKFTDGVCLFWNCNVDRYPANAIVCRFDTRVLS 364- 
++FYD P + B Y Y H DKF DG+C+FWNCNVD YP N++VCR+DTR LS 

Sbjct: 346 FRFYDKNPIVRNVKQFB TOYNQHKDKFADGLCMFWNCllA/I)CYPDNSLVCRYDTROT.S 402 

Query: 365 NLNLPGCDGGSLYVNKHAFHTPAFDKSAFTNLKQLPFFYYSDSPCESHGKQWSDIDYVP 424 
NLPGC+GGSLYVNKHAF+TP FD+ +F NLK +PFF+Y SPCE+ V+ D V 
Sbjct: 403 VFNLPGCNGGSLYVNKHAFYTPKFDRISFRNLKAMPFFFYDSSPCETIQVDGVAQ-DLVS 461 

Query: 425 LKSATCITRCMiGGAVafflHAHBYRQYLnAYNMMISAGFSLWIYKQFDTYNLWNTFTRLQ 484 
L + CIT+CN+GGAVC+ HA Y +++ +YN ++AGF+ W+ + + YNLW +F+ LQ 
Sbjct: 462 LATKDCITKCMEGGAVOaCHAQMYAEFVTSYNAAVTAGFTFWVTNKLNPYNLWKSFSALQ 521 

>gi|9635157 |ref |NP_058422.11 replicase [Transmissible gastroenteritis 
virus] 

gi|7801348|emb|CAB91143.l| replicase [Transmissible gastroenteritis virus] 
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Length = 6685 

Score = 545 bits (1403), E35>ect = e-153 

Identities = 261/484 (53%), Positives = 335/484 (69%), Gaps = 13/484 (2%) 

Query: 4 KDMTYRRLISMMGFKMNYQVNGYPNMFITREEAIRHVRAWIGFDVEGCHATRDAVGIlttiP 63 
KD+ Y +IS MGF+ + GY +F TR+ A+R+VRAW+GFDVEG H D VGTN+P 
Sbjct: 5574 KDWANVISYMGFRFEANIPGYHlIiFCna)FAMI»IVRAWIiGFDVE6AHVCGDNVGTOVP 5633 

Query: 64 LQLGFSTGVNLVAVPTGYVDTENNTEFTRVNAKPPPGDQFKHLIPLMYKGLPWNVVRIKI 123 
LQLGFS GV+ V G V TB V A+ PPG+QF HLIPLM KG PW++VR +I 

Sbjct: 5634 LQLGFSNGVDFWQTEGCVITEKGNSIEWKARAPPGEQFAHLIPLMRKGQPWHIVRRRI 5693 

Query: 124 VQMLSOTLKGLSDRVVFVLWAHGFELTSMKYFVKIG-PERTCCLCDKRATCFSTSSDTYA 182 
VQM+ D GliSD ++FVLWA G ELT+M+YFVKIG P+f C C K ATC+S+S YA 
Sbjct: 5694 VQMVCDYFDGLSDILIFVLWAGGLELTTMRYFVKIGRPQK— CECGKSATCYSSSQSVYA 5751 

Query: 183 OmraSVGFDYVYNPFMIDVQQWGFTGNLQSNIffiQHCQVHGNAHVASCDAIMTRCLAVHEC 242 
C+ H++G DY+YNP+ ID+QQWG+TG+L NH + C +H N HVAS DAIMTRCLA+H+C 
Sbjct: 5752 CS’KHAI/SOJYLYNPYCIDIQQWGYTGSLSMOTfflEVCMrHRiraiVASGimiMTRCUU:^ 5811 

Query: 243 FVKRVrWSVEYPIIGDELRWSA(3aCVQHMWKSAIiLADKFPVLHDIGNPKAIKCVPQAE 302 
FVKRVDWS+ YP I +E ++N A R VQ V+K+AL +HD+GNPK I+C 

Sbjct: 5812 FVKRVDWSIVYPFIDNEEKINKAGRIVQSHVMKAALKIFNPAAIHDVGNPKGIRCA-TTP 5870 

Query: 303 VEWKFYDAQPCSDKAYKIEELFYSYAIHHDKFTIXSVajFWNCNVDRYPANAIVCRFDTRV 362 
+ WYD P++ +LYY+H +G+ LFWNCNVD YP +IVCRFDTR 
Sbjct: 5871 IPWFCYDRDPINN— NVRCLDYDYMVHGQ— MNGLMLFWNCNVDMYPEFSIVGRFDTRT 5925 

Query: 363 LSNLNLPGCDGGSLYVNKHAFHTPAFDKSAFTNLKQLPFFYYSDSPCESHGKQWSDIDY 422 
S L+L GC+GG+LYVN HAFHTPA+D+ AF LK +PFFYY DS CE V +Y 
Sbjct: 5926 RSKLSLEG<3IGGALYVNlflIAFHTPAYDRRAFAKLKPMPFFYYDDSNCE LVDGQPNY 5981 

Query: 423 WLKSATCITR(mX3GAV(3dfflftllEYRQYIiDAYNMMISA6FSLWIYKQFDTYNLWNTFTR 482 
VPLKS CIT+CN+GGAVC+ HA' YR Y++ YN-f + AGF++W + FDTY. LW+ F 
Sbjct: 5982 VPLKSNVCITKCNIGGAVCKKHAALYRAYVEDYNIFMQAGFTIWCPQNFDTYMLVffiGFVN 6041 

Query: 483 LQSL 486 
++L 

Sbjct: 6042 SKAL 6045 



>gi 19387582 ref |NP_598309.ll Poll [porcine epidemic diarrhea virus] 

gi 13752450 gb|AAK38661.ll Poll [porcine epidenic diarrhea virus] 

Length = 6781 

Score = 541 bits (1394), Expect = e-152 

Identities = 256/480 (53%), Positives = 334/480 (69%), Gaps = 12/480 (2%) 

Query: 8 YRRLISMMGFKMNYQVNGYPNMFITREEAIRHVRAWIGFDVEGCHATRDAVGTNLPLQLG 67 
Y +IS MGF+ + + + +F TR+ A+R+VR W+GFDVEG H VGTN+PLQLG 

Sbjct: 5675 YEHVISFMGFRFDINIPNHHTLFCrRDFAMRNVRGWLGFDVEGAHVVGSNVGTNVPLQLG- 5734 

Query: 68 FST6VOT.VAVPTGYVDTENNTBFTRVNAKPPPGDQFKHLIPLMYKGLPWNVVRIKIVQML 127 
FS GV+ V P G V TE+ V A+ PPG+QF HL+PL+ +G PW+WR +IVQM 

Sbjct: 5735 FSNGVDFWRPBGCWTESGDYIKPVRARAPPGEQFAHLLPLLKRGQPWDVVRKRIVQMC 5794 
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Query: 128 SDTLKGLSDRVVFVLWAHGPBLTSICCYFVKIGPERTCCTA:I)KRATCFSTSSDTYACWNHS 187 
SD L LSD ++FVLWA G BLT+M+YFVKIGP ++C C K ATC++++ TY C+ H+ 
Sbjct: 5795 SDYLANLSDILIFVLmGGLELTTMRYFVKIGPSKSCD-CGKVATCYNSALHTYCCFKHA 5853 

Query: 188 VGFDYVYNPFMIDVQQWGFTGNLQSNHDQHCQVHGNAHVASCmiMTRCLAVHECFVKRV 247 
+G DY+YNP+ ID+QQWG+ G+L NH +HC VH N HVAS DAIMTRCLA+H+CFVK V 
Sbjct: 5854 im)YLYNPYCIDIQ(SfGYKGSLSLNHHEHCNVHRNEHVASGDAIiraiCLAIHDCFVKNV 5913 

Query: 248 DWSWPIIGDELRVNSACRKVQHMVVKSALLADKFPVLHDIGNPKAIKCVPQAEVEWKF 307 
DWS+ YP IG+E +N + R VQ ++S L ++DIGNPK I+C + +W 

Sbjct: 5914 DWSITYPFIGNEAVINKSGRIVQSHTMRSVLKLYNPKAIYDIGNPKGIRCA-VTDAKWFC 5972 

Query: 308 YDAQPCSDKAYICTEELFYSYAIHHDKFTDGVCLFWNCNVDRYPANAIVCRFDTRVLSNLN 367 
+D P + +E Y Y I H +F DG+CLFWNCNVD YP ++VCRFDTR S LN 
Sbjct: 5973 FDKNPTNSNVKTLB YDY-ITHGQF-DGLCLFWNCNVDMYPEFSVVCRFDTRCRSPIiN 6027 

Query: 368 LPGCDGGSLYVNKHAFHTPAFDKSAPTmKQLPFFYYSDSPCESHGKQWSDIDYVPLKS 427 
L GC+GGSLYVN HAFHTPAFDK AF LK +PFF+Y D+ C+ ++ I+YVPL++ 
Sbjct: 6028 LEGCNGGSLYVNNHAFHTPAFDKRAFAKLKPMPFFFYDDTECD KLQDSINYVPLRA 6083 

Query: 428 ATCITROHiGGAVCRHHANBYRQYLDAYNMMISAGFSLWrYKQFDTYNLWNTFT-RLQSL 486 
+ CIT+CN+GGAVC H Y Y++AYN SAGF++W+ FDTYNLW TF+ LQ L 
Sbjct: 6084 SNCITKOJVGGAVCSKHCAMYHSYVNAYNTFTSAGFTIWVPTSFDTYNIiWQTFSNNLQGL 6143 



>gi|l2175747|ref | np_ 073549.1| replicase polyprotein lab [Human coronavirus 
229E] 

gi|l2082740|gb|AAG48591.l|AF304460_2 replicase polyprotein lab [Human 
coronavirus 229E] : 

Length =6758 

Score = 535 bits (1379), Expect = e-151 

Identities = 254/478 (53%), Positives = 329/478 (68%), Gaps = 13/478 (2%) 

(Juery: 7 IYRRLISMMGFKMNYQVNGYPNMFITREEAIRHVRAWIGFDVEGCHATEDAVGTNLPLQL 66 
TY +IS MGF+ + + G ++F TR+ A+RHVR W+G DVEG H T D VGTN+PLQ+ 

Sbjct: 5642 TYEHVISYMGFRFDVSMPGSHSLFCTRDFAMRHVRGWLGMDVEGAHVTGDNVGTNVPLQV 5701 

Query: 67 GFSTGVNLVAVPTGYVDTENNTEFTRVNAKPPPGDQFKHLIPLMYKGLPWNVVRIKIVQM 126 
GFS GV+ VAPGVT + VA+ PPG+QF H++PL+ KG PW+V+R +IVQM 
Sbjct: 5702 GFSNGVDFVAQPBGCVLTNTGSWKPVRARAPPGEQFTHIVPLLRKGQPWSVLRKRIVQM 5761 

(Juery: 127 LSDTUtGLSDRWFVLWAHGFELTSMKYFVKIGPERTCCLCDKRATCFSTSSDTYACWNH 186 
++D L G SD +VFVLWA G ELT+M+YFVKIG + C C ATC+++ S+ Y C+ H 
Sbjct: 5762 lADFLAGSSDVLVFVLWAGGLELTTMRYFVKIGAVKH-CQCGTVATCYWSVSNDYCCFKH 5820 

Query: 187 SVGFDYVYNPFMIDVQQWGPTGNLQSNHDQHCQVHGNAHVASCDAIMrRCLAVHECFVKR 246 
++G DYVYNP++IEH-QQWG+ G+L +NH C VH N HVAS DAIMTRCLAV++CFVK 
Sbjct: 5821 ALGCDYVYNPYVIDIQQWGYVGSLSTNHHAICNVHRNEHVASGDAIMTRCLAVYDCFVKN 5880 

(Juery: 247 VDWSVEYPIIGDELRVNSAC3UCVQHMVVKSALLADKFPVLHDIGNPKAIKCVPQAEVEWK 306 
VDWS+ YP+I +B +N R VQ ++++A+ +HDIGNPK I+C + +W 

Sbjct: 5881 VDWSITYPMIANENAINKGGRTVQSHIMRAAIKLYNPKAIHDIGNPKGIRCA-VTDAKWY 5939 

(2uery: 307 FYDAQPCSDKAYKIEELFYSYAIHHDKFTDGVCLFWNCNVDRYPANAIVCRFDTRVLSNL 366 
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YD P + +E Y Y H DG+CLFWNCayiVD YP +IVCRFDTR S L 
Sbjct: 5940 CYDKNPINSNVKTLE YDYIOTIGQ--MDGLCI 1 FMCNVIMYPEFSIVCRFDTRTRSTL 5994 

Query: 367 NLPGCDGGSLYVNKHAFHTPAFDKSAFTNLKQLPFFYYSDSPCESHGKQWSD-IDYVPL 425 
NL G +GGSLYVN HAFHTPA+DK A LK PFFYY D CE W D ++YVPL 
Sbjct; 5995 NLEGWGGSLYVNNHAFHTPAYDKRAMAKLKPAPFFYYDDGSCE WHDQVNYVPL 6049 

Query: 426 KSATCITROJLGGAVCIUfflANEYRQYIiDAYNMMISAGFSLWIYKQFDTYNLVaNTFTRL 483 
++ CIT+CN+GGAVC HAN YR Y+++YN+ AGF++W+ FD YNLW TFT + 

Sbjct: 6050 RATNCITKCNIGGAVCSKHANLYRAYVESYNIFTQAGFNIWVPTTFDCYNLWQTFTEV 6107 



>gi 133591 1 sp P18458|RRPB_BEV RNA-directed RNA polymerase (ORFIB) ' 
gi 94017 |pir |S11238 polymerase - Berne virus 

gi 1334814 |emblCAA36601.1 I 2nd polymerase reading frame (AA 1-2291) [Berne 
virus] 

Length =2291 

Score = 50.1 bits (118), Expect = 8e-05 

Identities- = 37/103 (35%) , Positives = 54/103 (52%) , Gaps = 11/103 (10%) 

Query: 140 FVLVmHGFELTSMKYFVKIGPERTC--CIXn)KRATCFSTSSDTYACWNHSV6F--DYVYN 195 
F+L+-I- +L S+K++V+ TCCC+AC +YCN G +N 
Sbjct; 1511 FIIiYSCSNDLKSLKFYVEFD TCYFCSCGEMAIOiMRDGN-YKCRNCYGGMLISKLVN 1566 

Query: 196 PFMIDVQQWGFTGNLQSNHDQHC-QVHGNAHVASCDAIMTRCL 237 
-^DVQ+ LQ HD C Q HG++H A CDA+MT+CL 

Sbjct: 1567 CKYLDVQKERV— KLQDAHDAICQQFHGDSHEALCDAVMTKCL 1607 

>gi|l51306l|dbj |baa 13323.1| cyanoprotein alpha stibunit precursor [Riptortus 
clavatus] 

Length = 693 

Score = 34.7 bits (78), Expect = 3.7 

Identities = 16/36 (44%), Positives = 22/36 (61%), Gaps = 1/36 (2%) 

Query: 371 CDGGSLYVHKHAFHTPAFDKSAFTNLKQLPFFYYSD 406 
C G LY +KHA P FD+ A+ + Q+P FY+ D 
Sbjct: 643 CGGSKLYDSKHAMGFP-FDRPAYPDAFQVPNFYFKD 677 



Database: All non-redundant GenBank CDS 
translations+PDB+SwissProt+PIR+PRF 
Posted date: J^r 11, 2003 2:30 AM 

Niamber of letters in database: 454,141,287 
Number of sequences in database: 1,411,415 

Lambda K H 

0.325 0.139 0.456 

Gapped 

Lambda K H 

0.267 0.0410 0.140 
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Matrix: BL0SUM62 

Gap Penalties: Existence: 11, Extension: 1 

Number of Hits to DB: 473,361,261 

Number of Sequences: 1411415 

Number of extensions: 20503315 

Number of successful extensions: 51018 

Number of sequences better than 10.0: 27 

Number of HSP’s better than 10.0 without gapping: 26 

Number of HSP's successfully gapped in prelim test: 1 

Number of HSP's that attempted gapping in prelim test: 50937 

Number of HSP ' s gapped (non-prelim) : 33 

length of query: 486 

length of database: 454,141,287 

effective HSP length: 127 

effective length of query: 359 

effective length of database: 274,891,582 

effective search space: 98686077938 

effective search space used: 98686077938 

T: 11 

A: 40 

XI: 15 ( 7.0 bits) 

X2: 38 (14.6 bits) 

X3: 64 (24.7 bits) 

SI: ,40 (21.6. bits) 

S2: 75 (33.5 bits) 
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FIGURE 126 



5' 3' Frame 1 

QVHQNVCVL-LIFYIM'LSR--SHKICQ-FQKWSRLQLmKFHSCF6VRMDMIiKPSTQN 

YKQVIOlGNQVIJlCLTCTRaCECFLKSVTFRIMVKMLLYQKE MSQSILNCVNT-IHLL 

-LYPPT-ELFTLVL 

5' 3' Frame 2 

RFIKMCVFCD-SFT— LCRDNKVTRFVSDFKSGQGYN-LC-NFIHALV-GWTC-NLLPKT 
TSKSSVATRCCDA-LVQDAKNAS-KV-PSELW-KCCYTKRNNDECRKVYSTVSILKYTYF 
SCTLQHESYSLWCW 

5' 3* Frame 3 

GSSKCVCSVIDLLLDDFVEIIKSQDLSVISKWKVTIDYAEISFMLWCKDGHVETFYPKL 

QASQAWQPGVAMPNLYKMQRMLLEKCDLQNYGENAVIPKGIMMNVAKYTQLCQYLNTLTL 

AVPSNMRVIHFGAG 

3 >5' Frame 1 

PAPK-ITLMLEGTAKVSVFKY-HS-VYFATFIIIPFGITAFSP-F-RSHFSRSILCILYK 

LGIATPGCHA-LACSFG-KVSTCPSLHQSMNEISA-SIVTLTTFEITDKSCDFIISTKSS 

SKRSITEHTHFDEP 

3 '5' Frame 2 

QHQSE-LSCWRVQLK-VYLSIDTVEYTLRHSSLFLLV-QHFHHNSEGHTFQEAFFASGTS 

-ASQHLVATLDLLWLGRRFQHVHPYTKA-MKFQHSQL-P-PLLKSLTNLVTLLSRQSHQ 

VKDQSQNTHIIxMNL 

3 >5' Frame 3 

STKVNNSHVGGYS-SKCI-VLTQLSILCDIHHYSFWYNSIFTIILKVTLFKKHSLHLVQV 

RHRNTWLPRLTCL-FWVEGFNMSILTPKHE-NFSIVNCNLDHF-NH-QIL-LYYLDKVIK 

-KINHRTHTF-T 




wo 2004/092360 



179/193 



PCT/US2004/011710 



FIGURE 127 



5 '3' Frame 1 

-WTYPGKMQPRSLVDLFSKRTN-NV--WTPIKPT-CPPHYIWWTHRFN-Q-PEWRTM 

GQGQNSADPKVYPIILIUjGSQLSLSMARRNLDSLEARAFQSTPIWQMTKLATTEELPDE 

FVVVTM-KSSAPIX3TSIT-ELAQKLHFPTALTKKASYGLQLREP-IHPKTTLAPAILIT 

MLPPCYNFLKEQHCQKASTQREAEAAVKPLLAPHHWAVIQEIQLLMVGEILLLEWLAE 

VVKLPSRYCC-TD-TSLRAKFLVKANNimKLSLRNLLLmKSIJyKmPQNSTTSLKH 

LGDWQNKPKEISGTKT-SDKELITNIGPQIAQFA 

5'3' Frame 2 

RFLPTQEKPTNLDLL-ICSLNEQIKMSDNGPQSNQRSAPRITFGGPTDSTDMQNGGLQW 
GKAKTAPTPRFTQ-YCVLVHSSHSAWQGGT-IPSRPGRSNQHQ-WSR-PNWLLPKSYPTS 
SWW-RQNERAQPQMVLLLPRNWPRSFTSLRR-QRRHRMGCN-GSLEYTQRPHWHPQS— Q 
CCHRATTSSRMIAKRLLRRGKQRRQSSLFSLLIT-SR-FKKFNSWQQ-GKFSCSNG-RR 
W-NCPRAIAARQIEPA-EQSFW--RPTTTRPNCH-EICC-GI-KASPKTYCHKTVQRHSSI 
WETWSRTNPRKFRGPRPNQTRN-LQTLGRKLHNLP 

5'3* Frame 3 

GFYLPRKSQPTSISCRSVL-TNKLKCLIMDPNQTNWPPALHLVDPQIQLTITRMEDCNG 

ARPKQRRPQGLPNNIASWFTALTQHGKEELRFPRGQGVPINTNSGPDDQIGYYRRATRRV 

RGGIX3KMKELSPRWYFYYLGTGPEASLPYGAMEGIVWATEGALNTPKDHIGTROTNNN 

AATVLQLPQGTTLPKGFYAEGSRGGSQASSRSSSRSRGNSRNSTPGSSRGNSPARMASGG 

GETALALLLLDRLNQLESKVSGKGQQQQGQTVTKKSAAEASKKPRQKRTATKQYNVTQAF 

GRRGPEQTQGNFGDQDLIRQGTDYKHWAANCTIC 

3 '5* Frame 1 

RQIVQFAAQCL-SVPCLIRSWSPKFPWVCSGPRLPNA-VTLYCFVAVRFWRGFLDASAAD 
FLVTVWPCCCWPLPETLLSSWFNLSSSNSARAVSPPPLAIRAGEFPLLLPGVEFLELPRL 
RDEEREEA-LPPLLPSA-KPFGNWP-GSCSTVAALLLGLRVPMWSLGVFKAPSVATHTM 
PSLLAP-GSEASGPVPR— KYHLGLSSFILPSPPRTRRVALR— PIWSSGPLLVLIGTPW 
PRGNLSSSLPC-VRAVNQDAILLGKPWGRRCFGLAPLQSSILVIVS-ICGSTKCNAGGTT 
LV-LGSIIRHFNLFV-RTDLQEIEVGWLFLGR-KP 

3 >5' Frame 2 



GKLCNLRPNVCNQFLV-LGLGPRNFLGFVLDHVSQMLE-RCTVLWQYVFGEAF-MPQQQI 
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S— QFGLVWGLYQKLCSQAGSICLMIARGQFHHLR-PFEQENFPYCCQELNFLNYRDY 
VMSEmiDCRLCFPLRRSLLAMLFLEEVVMWQHCY-DCGCQCGLWVYSRLPQLQPIRC 
LLC-RRREVKLLGQFLGMSTIWG-ALSFCRIfflHELVG-LFGSSQFGHLDHYWC-LERPG 
LEGI-VPPCHAE-EL-TKTQYYWVNLGVGAVLALPHCSPPFWLLSVESVGPPNVMRGALR 
WFDWGPLSDliilCSFREQIYKRSRLVGFSWVGKNL 

3>5> Frame 3 

ANCAICGPMFVISSLSD-VLVPEISLGLFWTTSPKCLSDWLFCGSTFLARLFRCLSSRF 
LSDSLALLLLAFTRNFALKLVQSV-QQ-REGSFTTSASHSSRRISPTAARS-IS-ITATT 
— GARRGLTAASASLCVEAFWQCCSLRKL-HGGSIVIRIAGANWFGCIQGSLSCNPYDA 
FFVSAVGK-SFWASS-VIEVPSGAELFHFAVTTTNSSGSSSWANLVIWTTIGVDWNALA 
SRESKFLLAMLSESCEPRRNIIG-TLGSALFWPCPIAVLHSGYCQLNLWVHQM-CGGHYV 
GLIGVHYQTF-FVRLENRSTRDRGWLAFPG-VKT 



FIGURE 128 

-GLELKL-LTSICAF-PFCYSLF— CLLYFGFHSKSRI-KNLVPKSKRT-NFSLF-LVFL 
YAVAYAL-YSAmiNLMCLKILVRYM’RGNTYSTAWLCALGKVLPFHRWHTMVQTCTPN 
WINCQDPAGGALIARCmLHEGHQTAAFRDVLVVLNKRTN-NV--WTPIKPT-GPPHYI 
WWTHRFN-Q-.PEWRTQWGKAKTAPTPRFTQ-YCVLVHSSHSAWQGGT-IPSRPGRSNQHQ 
-WSR-PNWLLPKSYPTSSWW-RQNERAQPQMVLLLPRNWPRSFTSLRR-QRRHRMGCN-G 
SLEYTQRPHWHPQS— QCCHRATTSSRNNIAKRLLRRGKQRRQSSLFSLLIT-SR-FKKF 
NSWQQ-GKFSCSNG-RRW-NCPRAIAARQIEPA-EQSFW-RPTTTRPNCH-EICC-GI-K 
ASPKTYCHKTVQRHSSIWETWSRTNPRKFRGPRPNQTRN-LQTLAANCTICSKCLCILWN 
VTHWHGSHTFGMADLSWSH-IG-QRSTIQRQRHTAEQAH-RIQNIPTNRA-KGQKEKD- 
-SSAFAAETKEAAHCDSSSC 

EDSSSSFN-LLFVLFSLSAIPCFNNAYYILVFTRNPGSRRTLYQSLNEHETSHCFDLYFS 
MQLHMHCSTALCI— TSCA-RSL-GTTLGVILIALLGFVL-ERFYLFIDGTLWFKHAHLM 
LLSmiQLVm-LGVGTFMKVTKLLHLETYLLF-INEQIKMSDNGPQSNQRSAPRITF 
GGPTDSTDNNQNGGRNGARPKQRRPQGLPNNIASWFTALTQHGKEELRFPRGQGVPINTN 
SGPDDQI6YYRRATRRVRGGDGKMKELSPRWYFYYLGTGPEASLPYGANKEGIVWVATEG 
ALNTPKDHIGTRNPNNNAATVLQLPQGTTLPKGFYAEGSRGGSQASSRSSSRSRGNSRNS 
TPGSSRGNSPARMASGGGETALALLLLDRLNQLESKVSGKGQQQQGQTVTKKSAAEASKK 
PRQKRTATKQYNVTQAFGRRGPEQTQGNFGDQDLIRQGTDYKHWPQIAQFAPSASAFFGM 
SRIGMEVTPSGmTYHGAIKLDDKDPQFKDNVILLNKHIDAYKTFPPTEPKKDKKKKTD 
EAQPLPQRQKKQPTVTLLP 
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RTRAQALIDraCFLAFLLFLVLIMLIIFWFSLEIQDLEEPCTKV-TNMKLLIVLTCISL 

CSCICTWQRCASNKPHVLEDPCKVQH-G-YL-HCLALCSRKGFTFS-MAHYGSNMHT-C 

YYQLSRSSWWCAYS-VLVPS-RSPNCCI-.RRTCCFK-TNKLKCLIMDPNQTNVVPPALHL 

VDPQIQLTITRMEDMGQGQNSADPKVYPIILRLGSQLSLSMARRNLDSLEMAFQSTPI 

WQMTKLATTEELPDEFVVVTM-KSSAPDGTSIT-ELAQKIiHFPTALTKKASYGLQIiRE 

P-IHPKTTLAPAILITMLPPCYNFLKEQHCQKASTQREAEAAVKPLLAPHHWAVIQEIQ 

LLMVGEILLLEWLAEVVKLPSRYCC-TD-TSLRAKFLVKANNNKAKLSLRMi 

LAKNVLPQNSTTSLKHLGDWQNKPKEISGTKT-SDKELITNIGRKLHNLLQVPLHSLEC 

HAMWKSHLREHG-LIMEPLNWMTKIMSKTTSYC-TSTLTHTKHSHQQSLKRTKRKRM 

KLSLCRRDKRSSPL-LFFL 



FIGURE 129 



5>3> Frame 1 

taccgtagactcatctctatgatgggtttcaaaatgaattaccaagtcaatggttaccct 

Y R R L I S M M 6FKMNYQVNGYP 

aatatgtttatcacccgcgaagaagctattcgtcacgttcgtgcgtggattggctttgat 

NMFITREEAIRHVRAWIGFD 

gtagagggctgtcatgcaactagagatgctgtgggtactaacctacctctccagctagga 

V EGCHATRDAVG TN LPLQLG 

ttttctacaggtgttaacttagtagctgtaccgactggttatgttgacactgaaaataac 

F S T G V N L V A V P T G Y V D T E N N 

acagaattcaccagagttaatgcaaaacctccaccaggtgaccagtttaaacatcttatacc 

TEFTRVNAKPPPGDQFK HLI 



5' 3' Frame 2 

taccgtagactcatctctatgatgggtttcaaaatgaattaccaagtcaatggttacccta 
TV DSSL-WVSK - ITKSMVTL 
atatgtttatcacccgcgaagaagctattcgtcacgttcgtgcgtggattggctttgatg 
ICLSPAKKLFVTFVRGLALM 
tagagggctgtcatgcaactagagatgctgtgggtactaacctacctctccagctaggat 
-RAVMQLEMLWVLTYLSS-D 
tttctacaggtgttaacttagtagctgtaccgactggttatgttgacactgaaaataaca 
FLQVLT--LYRLVMLTLKIT 
cagaattcaccagagttaatgcaaaacctccaccaggtgaccagtttaaacatcttatacc 
QNSPELMQNLHQVTSLNILY 
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5>3> Frame 3 

taccgtagactcatctctatgatgggtttcaaaatgaattaccaagtcaatggttaccctaa 
P-THLYDGFQNELP SQWL P- 
tatgtttatcacccgcgaagaagctattcgtcacgttcgtgcgtggattggctttgatgt 
YVYHPRRSYSSRSCVDWL-C 
agagggctgtcatgcaactagagatgctgtgggtactaacctacctctccagctaggatt 
RGLSCN-RCC6Y-PTSPARI 
ttctacaggtgttaacttagtagctgtaccgactggttatgttgacactgaaaataacac 
FYRC-LSSCTDWLC-H-K-H 
agaattcaccagagttaatgcaaaacctccaccaggtgaccagtttaaacatcttatacc 
RIHQS-CK TSTR-PV-TSYT 



3'5* Frame 1 

ggtataagatgtttaaactggtcacctggtggaggttttgcattaactctggtgaattct 
G.IRCLNWSPG GGFALTLVNS 
gtgttattttcagtgtcaacataaccagtcggtacagctactaagttaacacctgtagaa 
VLFSVST-PVGTATKLTPV E 
aatcctagctggagaggtaggttagtacccacagcatctctagttgcatgacagccctct 
N P S ,W R G R L V P T A S L V A - Q P: S 
acatcaaagccaatccacgcacgaacgtgacgaatagcttcttcgcgggtgataaacata 
TSK P IHART-RIASSRVINI 
ttagggtaaccattgacttggtaattcattttgaaacccatcatagagatgagtctacggta 
LG-PLTW-FILKPIIEMSLR 



3 '5' Frame 2 

ggtataagatgtttaaactggtcacctggtggaggttttgcattaactctggtgaattctg 
V-DV-TGHLVEVLH-LW-IL 
tgttattttcagtgtcaacataaccagtcggtacagctactaagttaacacctgtagaaa 
C Y-F QCQHNQSVQLLS-HL-K 
atcctagctggagaggtaggttagtacccacagcatctctagttgcatgacagccctcta 
ILAGEVG-YPQHL-LHDSPL 
catcaaagccaatccacgcacgaacgtgacgaatagcttcttcgcgggtgataaacatat 
HQSQSTHERDE-LLRG--TY 
tagggtaaccattgacttggtaattcattttgaaacccatcatagagatgagtctacggta 
-GNH-LGNSF-NPS-R- VYG 
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3'5* Frame 3 

ggtataagatgtttaaactggtcacctggtggaggttttgcattaactctggtgaattctgt 

YKMFKLVTWWRFCINSGEFC 

gttattttcagtgtcaacataaccagtcggtacagctactaagttaacacctgtagaaaa 

VIFSVNITSRYSY.-VNTCRK 

tcctagctggagaggtaggttagtacccacagcatctctagttgcatgacagccctctac 

S-LER-VSTHSISSCMTALY 

atcaaagccaatccacgcacgaacgtgacgaatagcttcttcgcgggtgataaacatatt 

IKANPRTNVTNSFFAGDKHI 

agggtaaccattgacttggtaattcattttgaaacccatcatagagatgagtctacggta 

RVTIDL-VIHFETHHRDESTV 

FIGURE 130 

10 20 30 40 50 60 

i I I I . I I, 

SEQ ID NO: 9997 KGHDLRRLISMGFKMNYQVNGYPNMFITREEAIRHVRAWIGFDVEGCHATRDAVG'IWLP 

SEQ ID NO: 10034 YRRLISMMGFKMNYQVNGYPNMFITREEAIRHVRAWIGFDVEGCHATRDAVGTNLP 

******************************************************* 

Prim. Cons. KGHD2RRLISMMGFKMNYQVNGYPNMFITREEAIRHVRAWIGFDVEGCHATRDAVGTNLP 

70 80 90 100 110 120 

I I I I I I 

SEQ ID NO: 9997 IKJIiGFSTGmVAVPTGYVDTENNTKFTRVNAQTSTSEQFKHLIPLMYKGLPWNVVRIKI 

SEQ ID NO: 10034 IKJLGFSTGmVAVPTGYWENNTEPTRVNAKPPPGDQFKHLI 

******************************** ..****** 

Prim. cons. IiQLGFSTGVNLYAVPTGYVDTENNT2FTRVNA222222QFKHLIPLMYKGLPWNWRIKI 

130 140 150 160 170 180 

I I I I I I 

SEQ ID NO: 9997 VQMLSDTLKGLSDRVVFVLWAHGFELTSMKYFVKIGPERTCCLCDKRATCFSTSSDTYAC 
SEQ ID NO: 10034 

Prim. cons. VQM.SDTLKGLSDRVVFVLWAHGFELTSMKYFVKIGPERTCCLCDKRATCFSTSSDTYAC 

190 200 

I I 

SEQ ID NO: 9997 WNHSVGPDYVYNPFMIDVQQWGLYG 
SEQ ID NO: 10034 
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figure 131 

5*3* Frame 1 

caggttcatcaaaatgtgtgtgttctgtgattgatcttttacttgatgactttgtcgaga 
QVHQNVCVL-LI FYLMTLSR 
taataaagtcacaagatttgtcagtgatttcaaaagtggtcaaggttacaattgactatg 
--SHKICQ-FQKWSRLQLTM 
ctgaaatttcattcatgctttggtgtaaggatggacatgttgaaaccttctacccaaaac 
LKFHSCFGVRMDMLKPSTQN 
tacaagcaagtcaagcgtggcaaccaggtgttgcgatgcctaacttgtacaagatgcaaa 
YKQVKRGNQVLRCLTCTRCK 
gaatgcttcttgaaaagtgtgaccttcagaattatggtgaaaatgctgttataccaaaag 
ECFLKSV T FRIMVKMLLYQK 
gaataatgatgaatgtcgcaaagtatactcaactgtgtcaatacttaaatacacttactt 
E---MSQSILNCVNT-IHLL 
tagctgtaccctccaacatgagagttattcactttggtgctgg 
-LYPPT-ELFTLVL 



5*3' Frame 2 

caggttcatcaaaatgtgtgtgttctgtgattgatcttttacttgatgactttgtcgagat 
RFIKMCVFCD-SFT --LCRD 
aataaagtcacaagatttgtcagtgatttcaaaagtggtcaaggttacaattgactatgc 
NKVT RFVSDF KSGQGYN-LC 
tgaaatttcattpatgctttggtgtaaggatggacatgttgaaaccttctacccaaaact 
-nfi'halv-gwtc^nllpkt 
acaagcaagtcaagcgtggcaaccaggtgttgcgatgcctaacttgtacaagatgcaaag 
TSKS SVATRCCDA-LVQDAK 
aatgcttcttgaaaagtgtgaccttcagaattatggtgaaaatgctgttataccaaaagg 
NAS-KV-PSELW-KCCYTKR 
aataatgatgaatgtcgcaaagtatactcaactgtgtcaatacttaaatacacttacttt 
NNDECRKV YSTVSILKYTYF 
agctgtaccctccaacatgagagttattcactttggtgctgg 
SCTLQHESYSLWCW 



5 * 3 ' Frame 3 

caggttcatcaaaatgtgtgtgttctgtgattgatcttttacttgatgactttgtcgagata 

GSSKCVCSVIDLLLDDFVBI 

ataaagtcacaagatttgtcagtgatttcaaaagtggtcaaggttacaattgactatgct 

IKSQDLSVISKVVKVTIDYA 

gaaatttcattcatgctttggtgtaaggatggacatgttgaaaccttctacccaaaacta 

EISFMLWCKDGHVETFYPKL 

caagcaagtcaagcgtggcaaccaggtgttgcgatgcctaacttgtacaagatgcaaaga 
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QASQAWQPGVAMPNLYKMQR 
atgcttcttgaaaagtgtgaccttcagaattatggtgaaaatgctgttataccaaaagga 
M L LEKCDLQNYGENAVIPKG 
ataatgatgaatgtcgcaaagtatactcaactgtgtcaatacttaaatacacttacttta 
IMMNVAKYTQLCQYLNTLTL 
gctgtaccctccaacatgagagttattcactttggtgctgg 
AVPSNMRVIHFGA 



3'5* Fraioe 1 

ccagcaccaaagtgaataactctcatgttggagggtacagctaaagtaagtgtatttaag 
PAPK- ITL M L EGTAKVSVFK 
tattgacacagttgagtatactttgcgacattcatcattattccttttggtataacagca 
Y-HS-VYFA T FIIIPFGITA 
ttttcaccataattctgaaggtcacacttttcaagaagcattctttgcatcttgtacaag 
FSP-F-RSHFSRSILCILYK 
ttaggcatcgcaacacctggttgccacgcttgacttgcttgtagttttgggtagaaggtt 
LGIATPGCHA -LACSFG-KV 
tcaacatgtccatccttacaccaaagcatgaatgaaatttcagcatagtcaattgtaacc 
STCPSL HQSMNEISA - S IVT 
ttgaccacttttgaaatcactgacaaatcttgtgactttattatctcgacaaagtcatca 
LTTFEITDKSCDFIISTKSS 
agtaaaagatcaatcacagaacacacacattttgatgaacctg 
SKRSITEHTHFDEP 



3 '5' Frame 2 

ccagcaccaaagtgaataactctcatgttggagggtacagctaaagtaagtgtatttaagt 
QHQSE-LS CWRVQLK-VYLS 
attgacacagttgagtatactttgcgacattcatcattattccttttggtataacagcat 
IDTVEYTLRHSSLFLLV-QH 
tttcaccataattctgaaggtcacacttttcaagaagcattctttgcatcttgtacaagt 
FHHNSEGHTFQEAFFASCTS 
taggcatcgcaacacctggttgccacgcttgacttgcttgtagttttgggtagaaggttt 
-ASQHLVATLDLLVVLGRRF 
caacatgtccatccttacaccaaagcatgaatgaaatttcagcatagtcaattgtaacct 
QHVHPYTKA-MKFQHSQL-P 
tgaccacttttgaaatcactgacaaatcttgtgactttattatctcgacaaagtcatcaa 
-PLLKSLTNLVTLLSRQSHQ 
gtaaaagatcaatcacagaacacacacattttgatgaacctg 
VKDQSQNTHILMNL 



3 '5' Frame 3 
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ccagcaccaaagtgaataactctcatgttggagggtacagctaaagtaagtgtatttaagta 
STKVNNSHVGGYS-SKCI-V 
ttgacacagttgagtatactttgcgacattcatcattattccttttggtataacagcatt 
LTQLSILCDIHHYSFWYN SI 
ttcaccataattctgaaggtcacacttttcaagaagcattctttgcatcttgtacaagtt 
FTI ILKVTLFKKHSLHLVQV 
aggcatcgcaacacctggttgccacgcttgacttgcttgtagttttgggtagaaggtttc 
RH RNTW.L PRLT CL - FWVEG F 
aacatgtccatccttacaccaaagcatgaatgaaatttcagcatagtcaattgtaacctt 
NMSILTPKHE-NFSIVNCNL 
gaccacttttgaaatcactgacaaatcttgtgactttattatctcgacaaagtcatcaag 
DHF-NH-QIL -L YYL DKVIK 
taaaagatcaatcacagaacacacacattttgatgaacctg 
-KINHRTHTF--T 



FIGURE 132 

5*3 ' Frame 1 

taggtttttacctacccaggaaaagccaaccaacctcgatctcttgtagatctgttctct 
-VFTYPGKANQPRSLVDLFS 
aaacgaacaaattaaaatgtctgataatggaccccaatcaaaccaacgtagtgccccccg 
KRTN-NV--WTPIKPT-CPP 
cattacatttggtggacccacagattcaactgacaataaccagaatggaggactgcaatg 
HYIWWTHRFN-Q-PEWRTAM 
gggcaaggccaaaacagcgccgaccccaaggtttacccaataatattgcgtcttggttca 
GQGQNSADPKVYPIILRLGS 
cagctctcactcagcatggcaaggaggaacttagattccctcgaggccagggcgttccaa 
QLSLSMARRNLDSLEARAFQ 
tcaacaccaatagtggtccagatgaccaaattggctactaccgaagagctacccgacgag 
ST PIV VQMT K LATTEELPDE 
ttcgtggtggtgacggcaaaatgaaagagctcagccccagatggtacttctattacctag 
FVVVTAK-KSS APDGTSIT- 
gaactggcccagaagcttcacttccctacggcgctaacaaagaaggcatcgtatgggttg 
ELAQKLHFPTALTKKASYGL 
caactgagggagccttgaatacacccaaagaccacattggcacccgcaatcctaataaca 
QLREP-IHPKTTLAPAILIT 
atgctgccaccgtgctacaacttcctcaaggaacaacattgccaaaaggcttctacgcag 
MLPPCYNFLKEQHCQKASTQ 
agggaagcagaggcggcagtcaagcctcttctcgctcctcatcacgtagtcgcggtaatt 
REAEAAVKPLLAPHHVVAVI 
caagaaattcaactcctggcagcagtaggggaaattctcctgctcgaatggctagcggag 
QEIQLLAAVGEILLLEWLAE 
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gtggtgaaactgccctcgcgctattgctgctagacagattgaaccagcttgagagcaaag 
VVKLPSRYCC-TD-TSLRAK 
tttctggtaaaggccaacaacaacaaggccaaactgtcactaagaaatctgctgctgagg 
F L V K A N NNKAKLSLRNLLLR 
catctaaaaagcctcgccaaaaacgtactgccacaaaacagtacaacgtcactcaagcat 
HLKSLAKNVLPQNSTTSLKH 
ttgggagacgtggtccagaacaaacccaaggaaatttcggggaccaagacctaatcagac 
LGDVVQNKPKEIS6TKT-SD 
aaggaactgattacaaacattgggccgcaaattgcacaatttgcct 
KELITNIGPQIAQFA 



5 '3' Frame 2 

taggtttttacctacccaggaaaagccaaccaacctcgatctcttgtagatctgttctcta 
RFLPTQEKPTNLDLL-ICSL 
aacgaacaaattaaaatgtctgataatggaccccaatcaaaccaacgtagtgccccccgc 
N. E QIKMSDNGPQSNQRSAPR 
attacatttggtggacccacagattcaactgacaataaccagaatggaggactgcaatgg 
ITFGGPTDSTDNN QNGGLQW 
ggcaaggccaaaacagcgccgaccccaaggtttacccaataatattgcgtcttggttcac 
GKAKTAPTPRFTQ-YCVLVH 
agctctcactcagcatggcaaggaggaacttagattccctcgaggccagggcgttccaat 
SSHSAWQGG T - IPSRPGRS.N 
caacaccaatagtggtccagatgaccaaattggctactaccgaagagctacccgacgagt 
Q H.Q - WSR-PNWLLPKSYPTS 
tcgtggtggtgacggcaaaatgaaagagctcagccccagatggtacttctattacctagg 
SWW- RQNERAQPQMVLLL PR 
aactggcccagaagcttcacttccctacggcgctaacaaagaaggcatcgtatgggttgc 
NWPRSFTSLRR-QRRH RMGC 
aactgagggagccttgaatacacccaaagaccacattggcacccgcaatcctaataacaa 
N- GSLEYTQRPHWHPQS--Q 
tgctgccaccgtgctacaacttcctcaaggaacaacattgccaaaaggcttctacgcaga 
CCHRATTSSRNNIAKRLLRR 
gggaagcagaggcggcagtcaagcctcttctcgctcctcatcacgtagtcgcggtaattc 
GKQRRQSSLFSLLIT-SR-F 
aagaaattcaactcctggcagcagtaggggaaattctcctgctcgaatggctagcggagg 
KKFNSWQQ-GKFSCSNG-RR 
tggtgaaactgccctcgcgctattgctgctagacagattgaaccagcttgagagcaaagt 
W-NCPRAIAARQIEPA-EQS 
ttctggtaaaggccaacaacaacaaggccaaactgtcactaagaaatctgctgctgaggc 
F .W-RPTTTRPNCH-EICC-G 
atctaaaaagcctcgccaaaaacgtactgccacaaaacagtacaacgtcactcaagcatt 
I-KASPKTYCHKTVQRHSSI 
tgggagacgtggtccagaacaaacccaaggaaatttcggggaccaagacctaatcagaca 
WETWSRTNPRKFRGPRPNQT 
aggaactgattacaaacattgggccgcaaattgcacaatttgcct 
RN-LQTLGRKLHNLP 



i 
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5*3' Frame 3 

taggtttttacctacccaggaaaagccaaccaacctcgatctcttgtagatctgttctctaa 
GFYLPRKSQPTSISCRSVL- 
acgaacaaattaaaatgtctgataatggaccccaatcaaaccaacgtagtgccccccgca 
TNKLKCLIMDPNQTNVVPPA 
ttacatttggtggacccacagattcaactgacaataaccagaatggaggactgcaatggg 
LHLVDPQIQLTITRMEDCN G 
gcaaggccaaaacagcgccgaccccaaggtttacccaataatattgcgtcttggttcaca 
ARPKQRRPQGLPNNIASWFT 
gctctcactcagcatggcaaggaggaacttagattccctcgaggccagggcgttccaatc 
ALTQHGKEELRFPRG QGVPI 
aacaccaatagtggtccagatgaccaaattggctactaccgaagagctacccgacgagtt 
NTNSG. PDDQIGYYRRATRR V 
cgtggtggtgacggcaaaatgaaagagctcagccccagatggtacttctattacctagga 
RGGDG KMKELSPRWYFYYLG 
actggcccagaagcttcacttccctacggcgctaacaaagaaggcatcgtatgggttgca 
TG PEA SLPYGANKEGIVWVA 
actgagggagccttgaatacacccaaagaccacattggcacccgcaatcctaataacaat 
TEGALNTPKDHIGTRNPNNN 
gctgccaccgtgctacaacttcctcaaggaacaacattgccaaaaggcttctacgcagag 
AATV LQLPQGTT. LP:KGFYAE 
ggaagcagaggcggcagtcaagcctcttctcgctcctcatcacgtagtcgcggtaattca 
GSRGGSQASSRSSSRSRGNS 
agaaattcaactcctggcagcagtaggggaaattctcctgctcgaatggctagcggaggt 
RNSTPGSSRGNSPARMASG.G 
ggtgaaactgccctcgcgctattgctgctagacagattgaaccagcttgagagcaaagtt 
GETALALLLLDRLNQLESKV 
tctggtaaaggccaacaacaacaaggccaaactgtcactaagaaatctgctgctgaggca 
SGKGQQQQG.QTVTKKSAAE A 
tctaaaaagcctcgccaaaaacgtactgccacaaaacagtacaacgtcactcaagcattt 
SKKPRQKRTATKQYNVTQAF 
gggagacgtggtccagaacaaacccaaggaaatttcggggaccaagacctaatcagacaa 
GRRGPEQTQGNFGDQDLIRQ 
ggaactgattacaeiacattgggccgcaaattgcacaatttgcct 
GTDYKHWAANCTIC 



3'5V Frame 1 

aggcaaattgtgcaatttgcggcccaatgtttgtaatcagttccttgtctgattaggtct 

RQIVQFAAQCL-SVPCLIRS 

tggtccccgaaatttccttgggtttgttctggaccacgtctcccaaatgcttgagtgacg 

WSPKFPWVCSGPRLPNA-VT 

ttgtactgttttgtggcagtacgtttttggcgaggctttttagatgcctcagcagcagat 
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LYCFVAVRFWRGFLDASA AD 
ttcttagtgacagtttggccttgttgttgttggcctttaccagaaactttgctctcaagc 
FLVTVWPCCCWPLPETLLSS 
tggttcaatctgtctagcagcaatagcgcgagggcagtttcaccacctccgctagccatt 
WFNLSSSNSARAVSPPPLAI 
cgagcaggagaatttcccctactgctgccaggagttgaatttcttgaattaccgcgacta 
RAGEFPLLLPGVEFLELPRL 
cgtgatgaggagcgagaagaggcttgactgccgcctctgcttccctctgcgtagaagcct 
RDEEREEA-LPPLLPSA-KP 
tttggcaatgttgttccttgaggaagttgtagcacggtggcagcattgttattaggattg 
FGNVVP-GSCSTVAALLLGL 
cgggtgccaatgtggtctttgggtgtattcaaggctccctcagttgcaacccatacgatg 
RVPMWSLGVFKAPSVATHTM 
ccttctttgttagcgccgtagggaagtgaagcttctgggccagttcctaggtaatagaag 
PSLLAP-G SEASGP V P R - - K 

taccatctggggctgagctctttcattttgccgtcaccaccacgaactcgtcgggtagct 
YHLGLSSFILPSPPRTRRVA 
cttcggtagtagccaatttggtcatctggaccactattggtgttgattggaacgccctgg 
L R - - P I w S S.G P L L V L I G T P W 

cctcgagggaatctaagttcctccttgccatgctgagtgagagctgtgaaccaagacgca 
PRGNLSSSLPC-VRAVNQDA 
atattattgggtaaaccttggggtcggcgctgttttggccttgccccattgcagtcctcc 
ILLGKPWGRRCFGLAPLQ SS 
attctggttattgtcagttgaatctgtgggtccaccaaatgtaatgcggggggcactacg 
ILVIVS - ICGST KCNAGGTT 
ttggtttgattggggtccattatcagacattttaatttgttcgtttagagaacagatcta 
LV-LGSIIRHFNLFV-RTDL 
caagagatcgaggttggttggcttttcctgggtaggtaaaaaccta 
Q:EIEVGWLFLGR-K.P 



3 '5' Frame 2 

aggcaaattgtgcaatttgcggcccaatgtttgtaatcagttccttgtctgattaggtctt 

GKLCNLRPNVCNQFLV-LGL 

ggtccccgaaatttccttgggtttgttctggaccacgtctcccaaatgcttgagtgacgt 

GPRNFLGFVLDHVSQMLE-R 

tgtactgttttgtggcagtacgtttttggcgaggctttttagatgcctcagcagcagatt 

CTVLWQYVFGEAF-MPQQQI 

tcttagtgacagtttggccttgttgttgttggcctttaccagaaactttgctctcaagct 

S--QFGLVVVGLYQKLCSQA 

ggttcaatctgtctagcagcaatagcgcgagggcagtttcaccacctccgctagccattc 

GSICLAAIARGQFHHLR-PF 

gagcaggagaatttcccctactgctgccaggagttgaatttcttgaattaccgcgactac 

EQENFPYCCQELNFLNYRDY 

gtgatgaggagcgagaagaggcttgactgccgcctctgcttccctctgcgtagaagcctt 

VMRSEKRLDCRLCFPLRRSL 

ttggcaatgttgttccttgaggaagttgtagcacggtggcagcattgttattaggattgc 
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LAMLFLEEVVARWQHCY-DC 
gggtgccaatgtggtctttgggtgtattcaaggctccctcagttgcaacccatacgatgc 
GCQC6LWVYS RLPQLQPIRC 
cttctttgttagcgccgtagggaagtgaagcttctgggccagttcctaggtaatagaagt 
LLC-RRREVKLLGQFLGNRS 
accatctggggctgagctctttcattttgccgtcaccaccacgaactcgtcgggtagctc 
TIWG-ALSFCRHHHELVG-L 
ttcggtagtagccaatttggtcatctggaccactattggtgttgattggaacgccctggc 
F G S SQFGHLDHYWC-LERPG 
ctcgagggaatctaagttcctccttgccatgctgagtgagagctgtgaaccaagacgcaa 
LEGI-VPPCHAE-EL-TKTQ 
tattattgggtaaaccttggggtcggcgctgttttggccttgccccattgcagtcctcca 
YYWVNLGVGAVLALPHCSPP 
. ttctggttattgtcagttgaatctgtgggtccaccaaatgtaatgcggggggcactacgt 
F W L L SVESVGPPNVMR G'A L R 
tggtttgattggggtccattatcagacattttaatttgttcgtttagagaacagatctac 
WFPWGP LSDILICSFRE.QIY 
aagagatcgaggttggttggcttttcctgggtaggtaaaaaccta 
K. R SRLVGFSWVGKNL 



3«5' Frame 3 

aggcaaattgtgcaatttgcggcccaatgtttgtaatcagttccttgtctgattaggtcttg 
•A NCAICG PMFV I S SLSD - ¥L 
gtccccgaaatttccttgggtttgttctggaccacgtctcccaaatgcttgagtgacgtt 
VPBISLGLFWTTSPKCLSDV 
gtactgttttgtggcagtacgtttttggcgaggctttttagatgcctcagcagcagattt 
VLFCGSTFLARL FRCL SSRF 
cttagtgacagtttggccttgttgttgttggcctttaccagaaactttgctctcaagctg 
LSDSLALLLLAF TRNFALKL 
gttcaatctgtctagcagcaatagcgcgagggcagtttcaccacctccgctagccattcg 
VQSV-QQ-REGS FTTSASHS 
agcaggagaatttcccctactgctgccaggagttgaatttcttgaattaccgcgactacg 
SRRISPTAARS-IS-ITATT 
tgatgaggagcgagaagaggcttgactgccgcctctgcttccctctgcgtagaagccttt 
--GARRGLTAASASLCVEAF 
tggcaatgttgttccttgaggaagttgtagcacggtggcagcattgttattaggattgcg 
WQCCSLRKL-HGGSIVIRIA 
ggtgccaatgtggtctttgggtgtattcaaggctccctcagttgcaacccatacgatgcc 
GANVVFGCIQGSLS CNPYDA 
ttctttgttagcgccgtagggaagtgaagcttctgggccagttcctaggtaatagaagta 
FFVSAVGK-SFWASS-VIEV 
ccatctggggctgagctctttcattttgccgtcaccaccacgaactcgtcgggtagctct 
PSGABL FHFAVTTTNSSGSS 
tcggtagtagccaatttggtcatctggaccactattggtgttgattggaacgccctggcc 
SVVANLVIWTTIGVDWNALA 
tcgagggaatctaagttcctccttgccatgctgagtgagagctgtgaaccaagacgcaat 
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SRESKFLLAMLSESCEPRRN 
attattgggtaaaccttggggtcggcgctgttttggccttgccccattgcagtcctccat 
IIG-TLGSALFWPC PIAVLH 
tctggttattgtcagttgaatctgtgggtccaccaaatgtaatgcggggggcactacgtt 
SGYCQLNLWVHQM-CGGHYV 
ggtttgattggggtccattatcagacattttaatttgttcgtttagagaacagatctaca 
GLIGVHYQTF-FVRLENRST 
agagatcgaggttggttggcttttcctgggtaggtaaaaaccta 
RDRGWLAFPG-VKT 
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FIGURE 133 
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FIGURE 134 
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